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CHAPTER 1 
INTRODUCTION 



1.1 SCOPE OF MANUAL 

The UDA50 Service Manual describes the maintenance and troubleshooting procedures needed to support 
the UDA50 Disk Controller. The manual covers both UDA50-resident diagnostic and UDA50 host-resident 
diagnostic operating procedures. The service manuals for individual disk products provide device-specific 
service information for troubleshooting disk subsystem problems. 

1.2 UDA50 MAINTENANCE PHILOSOPHY 

The maintenance philosophy planned for the UDA50 Disk Controller is module replacement. Field Service 
personnel should not attempt to replace or repair component parts within these modules. 

1.3 UDA50 FIELD REPLACEABLE PARTS 

The UDA50 Disk Controller consists of two hex modules, two flat ribbon intermodule cables, an unshielded 
standard disk interface (SDI) cable assembly, an I/O bulkhead assembly, and assorted hardware. Figure 1-1 
illustrates the major field replaceable units (FRUs) in a UDA50 assembly. 
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Figure 1-1 UDA50 Illustrated Parts 

1.4 UDA50 MAINTENANCE FEATURES 

The UDA50 Disk Controller has the following maintenance features: 

• UDA50-resident diagnostics 

• UDA50 LED maintenance displays 

• UDA50 host-resident diagnostics 
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The UDA50-resident diagnostic is a PROM-based microcode program that performs UDA50 self diagnosis 
upon powerup or hard initialization. 

A UDA50 maintenance display is located on each UDA50 module. Each display consists of four LEDs. 
These LEDs display current resident diagnostic activity and error codes caused by malfunctions. Figures 
1-2 and 1-3 show the location of the maintenance LEDs on each module. 
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Figure 1-2 Diagnostic LED Locations on UDA50 Module M7486 
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Figure 1-3 Diagnostic LED Locations on UDA50 Module M7485 
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The UDA50 host-resident diagnostics contain four tests that isolate subsystem faults to the UNIBUS or disk 
drives. A system exerciser program is also provided to test the performance of the entire disk subsystem. 

1.5 UDA50 ADDRESS SWITCHES AND JUMPERS 

The UDA50 Disk Controller contains two registers visible in the UNIBUS I/O page. They are the initializing 
and polling (IP) register and the status and address (SA) register. The IP and SA registers are assigned an 
octal UNIBUS address of 772150 and 772152, respectively. 

The UNIBUS address selector switches and a jumper plug (W13) are used to set the UNIBUS address for 
the IP register. The location of these switches and jumper plug on UDA50 module M7485 is shown in Figure 
1-4. Set the UNIBUS address switches and jumpers to the positions shown in Figure 1-5 to select UNIBUS 
address 772150. If 772150 (default address shipped with the UDA) cannot be used, alternate addresses are 
760334 and 760340. 
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Figure 1-4 M7485 Address Switch and Jumper Locations 

1-4 



UNIBUS 

ADDRESS 

BITS 17 16 15 


14 13 12 


11 10 9 


8 7 6 


5 4 3 


2 1 


OCTAL 

CODE 7 


7 


2 


1 


5 





BINARY 

CODE 1 1 1 


1 1 1 


1 


1 


1 1 





UDA50 

SWITCH 1 1 1 

SETTING I 


1 1 S10 

I ON 


S9 S8 S7 

OFF ON OFF 


S6 S5 S4 

OFF OFF ON 


S3 S2 S1 

ON OFF ON 


W13 

T1.T2 I I 




{ i 








ALWAYS 
ZEROS 


i 
ALWAYS ONES 



CX-262A 



Figure 1-5 UDA50 Switch Setting for Address 772150 



NOTE 
The UNIBUS address switches and jumpers should be 
set for a floating address when a second UDA50 is in- 
stalled on a system. Check the system configuration and 
UNIBUS addresses of all devices already installed. 
Common floating addresses are 760340 and 760330. 

In past disk products, a vector address was also physically selectable. This is not true with the UDA50 Disk 
Controller. A vector address, typically 154 (octal), will be supplied by the software. 

1.5.1 UNIBUS Tuning 

A UNIBUS system may experience data late conditions that can be remedied by tuning the UNIBUS. This 
process involves changing the relative positions of the nonprocessor request (NPR) devices on the bus. The 
device at the front of the bus (near the host) will have the highest priority. The device at the end of the bus 
will have the lowest priority. 

1.5.1.1 UNIBUS Device Positions - The UDA should be placed at the end of the UNIBUS (lowest NPR 
priority) because it is heavily buffered. Other NPR devices should be placed along the UNIBUS depending 
on their buffering. The NPR devices with the least amount of buffering should be placed at the front of the 
UNIBUS. 

1.5.1.2 UDA NPR Priority Jumper - A jumper has been inserted on the M7485 module to help tune 
the UNIBUS system. The jumper changes the average number of UDA NPR requests over a given amount 
of time by delaying the request for 0, 6.2, or 10 microseconds. Table 1-1 shows the amount of delay and 
jumper configuration. 
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Table 1-1 UNIBUS Delay 



Amount of Delay Jumper Configuration 



microseconds T4-T6 

6.2 microseconds T5-T6 

10 microseconds T6-T7 



If data late conditions are observed after setting the delay jumper to the 6.2 microseconds position, the jumper can 
be set to the 10 microseconds position (T6-T7). 

On some systems it will not be possible to remedy data late errors by changing the UDA NPR Priority Jumper. 
The following is a list of systems that cannot use a UDA along with rules on how many UDAs can be installed 
on a system: 

• The UDA/RK07/DMR1 1 configuration (on an 1 1/70 only) gives data late errors from the RK07 
regardless of the UDA's jumper setting. Either an RK07 or a UDA, but not both, can be con- 
figured on the 11/70 when a 1 megabit per second DMR11 is present. 

• On both PDP-11 and VAX systems, no more than two UDAs may be installed on a UNIBUS 
with nonbuffered UNIBUS peripheral devices. 

NOTE 
If a bus repeater is used, a greater possibility of data 
late errors exists. In general, the longer the UNIBUS, 
the greater the possibility of data late errors. 

1.5.1.3 UDA Burst Parameter - The UDA burst parameter is a host software value that indicates how 
many long words (32 bits) the UDA will attempt to transfer when it accesses the UNIBUS. The default for 
this parameter is 1, but can range from 1 to 32. 

Increasing the UDA burst parameter to a number greater than 1 increases the overall system efficiency. 
However, data late conditions are more likely to exist. 

1.6 UDA50 PRIORITY PLUG 

All UDA50 M7485 modules are shipped with a level 5 priority plug. Because this is the recommended priority 
level for UDA50 disk subsystems, the priority plug need not be changed for the majority of installations. 
If another priority level is required in some special circumstance, the current priority plug must be removed 
and the new one inserted. The location of the priority plug is shown in Figure 1-4. It should be inserted 
so the notch on the priority plug aligns with the hole on the module socket. 

1.7 INSTALLATION OF BOOTSTRAP ROM 

The proper bootstrap ROMs are shipped with the UDA50. Bootstrap ROM 23-767 A9-00 must be installed 
on the PDP-1 1 bootstrap ROM module M9312. Bootstrap ROM 23-990A9-00 must be installed on the VAX 
11/750. 
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1.8 RELATED DOCUMENTATION 

DIGITAL customers can order the following list of UDA50 related manuals: 

UDA50 USER GUIDE (EK-UDA50-UG) 
UDA50 SERVICE MANUAL (EK-UDA50-SV) 
UDA50 MAINTENANCE GUIDE (AA-M185A-TC) 
UDA50 FIELD MAINTENANCE PRINT SET (MP-01331) 
DSA CONTROLLER DOCUMENTATION KIT (QP906-GZ)* 
DSA DRIVES DOCUMENTATION KIT (QP907-GZ)* 

* The DSA Controller kit consists of a small looseleaf binder and the maintenance guides for all the 
DSA controllers. The DSA Drives Kit consists of the two small binders containing the current maintenance 
guides for disks that operate on the DSA controllers. 

Employees: The User Guide and Service Manual can be ordered directly from Publication 

and Circulation Services, 10 Forbes Road, Northboro, Massachusetts 01532 (RCS 
Code: NR12, Mail Code: NR03/W3). 

The Maintenance Guide, Maintenance Guide Looseleaf Binder, Maintenance 
Documentation Kit, and the Field Maintenance Print Set can be ordered direct- 
ly from the Software Distribution Center, 444 Whitney Street, Northboro, 
Massachusetts 01532 (RCS Code: MSDC, Mail Code: NR02-1/J6). 

Non-Employees: The above documents can.be ordered directly from Digital Equipment Corpora- 
tion, P.O. Box CS2008, Nashua, New Hampshire 03061, or by calling toll 
free: 800-258-1710. 

Outside the United States, consult local DIGITAL offices. 
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CHAPTER 2 
UDA50 FAULT ISOLATION 



2.1 UDA50-RESIDENT DIAGNOSTICS 

There are two ways of obtaining resident diagnostic information from the UDA50 Disk Controller. The first 
is through the UDA50 LED error codes. The second is by examining the contents of the UDA50 status/address 
(SA) register. The SA register contents are also supplied to the host CPU for error logs and diagnostic error 
reports. 

2.1.1 UDA50 LED Error Codes 

Table 2-1 lists the LED error codes and which FRU is most likely at fault. 

Table 2-1 LED Error and Symptom Codes 

Most Likely Failure 



Undefined 

M7485 or 
software 

M7485 or 
software 

M7485 or 

host 

inactive 



M7485 


M7486 


Error Symptoms 


LEDs 


LEDs 




842 1 


842 1 




000 1 


X X X X 


Hex 1; undefined 


00 10 


0000 


Hex 2; microcode stuck in 
init step 2 


00 11 


0000 


Hex 3; microcode stuck in 
init step 3 


100 


0000 


Hex 4; microcode stuck in 
init step 4 or UNIBUS 
timeout error 



B 

L 
101 

N 
K 

110 

X X X X 

111 

X X X X 

1000 



100 1 
0000 



0000 



X X X X 

110 

X X X X 

111 
0000 



0000 
10 1 



Hex 4/5; test complete 

UDA50 communicating with host software 



Hex 6; undefined 
Hex 7; undefined 



Hex 8; wrap bit 14 set in 
SA register 

Hex 9; board one error 



No problem 



Undefined 



Undefined 



M7485 or 
software 

M7485 
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Table 2-1 LED Error and Symptom Codes (Cont.) 



M7485 
LEDs 
842 1 



M7486 
LEDs 

842 1 



Error Symptoms 



Most Likely Failure 



10 10 
10 10 


0000 
10 10 


Hex A; board two error 


10 11 

X X X X 


X X X X 

10 11 


Hex B; undefined 


X X X X 

1100 


1100 

X X X X 


Hex C; Timeout error, 
check error code in SA re: 


110 1 

X X X X 


X X X X 

110 1 


Hex D; RAM parity error 


1110 

X X X X 


X X X X 

1110 


Hex E; ROM parity error 


1111 


1111 


Hex F; sequencer error 


Cycling 
pattern 


Cycling 
pattern 


None 


Cycling 
pattern 


Cycling 
pattern 


The cycling pattern contini 
of the host software initial 



UDA50 is not responding to the host CPU. 



M7486 



Undefined 



Many 
causes 

M7486 



M7485 

M7485 
No problem 

M7485 



* The LEDs normally cycle while the UDA50 is waiting for the host to start the initialization process. At that time, 
it responds to the initialization and the cycling pattern stops. This normally occurs in about two seconds. 

Note: 1 = LED ON = LED OFF x = May be ON or OFF 

When two codes are given for the same error, both indicate the same failure. 

2.1.2 Status/ Address Register Error Codes 

More detailed information on UDA50 functional and diagnostic error codes is reported through the SA register. 
The contents of this register may be examined manually through the CPU console at the UDA50 UNIBUS 
address plus 2. This address is normally 772152. Table 2-2 lists the SA error codes and indicates the most 
likely FRU at fault. 
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Table 2-2 SA Register Error Codes 



Error 
Code 
(Octal) 



Error Description 



Most 
Likely 
FRU 
Failed 



100001 UNIBUS packet read error 

100002 UNIBUS packet write error 

100003 UDA ROM or RAM parity error 

100004 UDA RAM parity error 

100005 UDA ROM parity error 

100006 UNIBUS ring read error 

100007 UNIBUS ring write error 

100010 UNIBUS interrupt master failure 

100011 Host access timeout error 

100012 Host exceeded command limit 

100013 UDA SI hardware fatal error 

100014 DM XFC fatal error 

100015 Hardware timeout of instruction loop 

100016 Invalid virtual circuit identifier 

100017 Interrupt write error on UNIBUS 
104000 Fatal sequence error 

104040 D processor ALU 

104041 D processor control ROM parity error 

105102 D processor with no BD # 2, or RAM parity error 

105105 D processor RAM buffer error 

105152 D processor SDI error 

105153 D processor write mode wrap serdes error 

105154 D processor read mode serdes, RSGEN & ECC error 

106040 U processor ALU error 

106041 U processor control register error 



M7485* 

M7485* 

M7485 or M7486 

M7486 

M7485 

M7485* 

M7485* 

M7485 

M7485* 

M7485* 

M7486 

M7486 

M7485* 

M7485* 

M7485* 

M7485 

M7485 

M7485 

M7486 

M7486 

M7486 

M7486 

M7486 

M7485 

M7485 



* Possibly the host CPU is at fault. 
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Table 2-2 SA Register Error Codes (Cont.) 



Error 
Code 
(Octal) 



Error Description 



Most 
Likely 
FRU 
Failed 



106042 U processor DFAIL/control ROM parity /BD # 1 
test CNT 

106047 U processor constant PROM error with D processor 
running SDI test 

106055 Unexpected trap found, abort diagnostic 

106071 U processor constant PROM error 

106072 U processor control ROM parity error 
106200 Step 1 data error (MSB not set) or RE-INIT 
107103 U processor RAM parity error 

107107 U processor RAM buffer error 

107115 Test count was wrong (BD _ 2) 

112300 Step 2 error 

122240 NPR error 

122300 Step 3 error 

142300 Step 4 error 



M7485 

M7485 

M7485 
M7485 
M7485 
M7485 
M7486 
M7486 
M7486 
M7485 
M7485 
M7485 
M7485 



* Possibly the host CPU is at fault. 

2.2 HOST-RESIDENT DIAGNOSTICS 

A different approach must be taken to isolate faults if no LED error code exists in the UDA50. The first 
step is to examine the customer error log to determine the source of the problem. The next step is to cycle 
the power off and back on, resetting much of the logic. The UDA50 then starts the initialization routine. 
Only the first phase will run as completion takes host software interaction. If the first phase passes, the next 
step is to run the available host-resident diagnostics. 

Host-resident diagnostics available to diagnose UDA50 problems are the PDP-11 CZUDC and the VAX 
EVRLA. Both diagnostic programs have the same tests and error messages with the exception of the pro- 
gram name (CZUDC or EVRLA). The programs consist of the following four tests: 



• Test 1 

• Test 2 

• Test 3 



UNIBUS interrupt/address test (checks out UDA50 functionality). 
Disk-resident diagnostic test (runs the drive resident diagnostics). 
Disk function test (performs minimum drive functional tests). 
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• Test 4: Disk exerciser test (performs a limited read and write test in the diagnostic cylinder 
area - the diagnostic will not write in the customer data area unless specifically instructed to do 
so, and a warning message will be printed). There are two modes of operation for test 4. 

1 . Default operation on the diagnostic cylinder or customer area with all parameters selected 
by the default answers shown below. 

2. Manual intervention to the test using new parameters that may include the customer data 
area. This manual intervention is referred to as a fifth test in test printouts. 

2.2.1 PDP-11 Subsystem Diagnostics Preparation 

The PDP-1 1 subsystem diagnostics CZUDC (UDA50 host-resident diagnostics) run on any disk drive cabled 
to the UDA50. The MSCP and SDI bus structure compensates for differences between disk drives. Also 
included with the PDP-1 1 subsystem diagnostic is a disk formatter program (CZUDE). The formatter (CZUDE) 
is not a diagnostic; do not run it unless specifically instructed. 

The CZUDC diagnostic program asks both hardware and software questions of the user. A sample printout 
of these questions, when the default conditions are selected, is shown below: 



l. '...• a. n i... j. 1 a. n -_3 



The remaining software questions apply to test 4 only. 



If the manual intervention question is answered yes, the following series of questions will be asked. Manual 
intervention should be used only to further isolate problems after running the test with the default answers 
shown above. 

THE FOLLOWING QUEST IONS PEFEP TO UNIT XA UDA AT NN' :: :N;K; :: ! DPIUE ! : \N> : ' 

BAD BLOCK <A> " : ' 
) yqij ijAHT TO CHANGE TESTING PARAMETERS FOP THIS URIUE >., L .:■ N '"' 
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Answer this question "N" to bypass further questioning. A "Y" answer results in the following questions. 



If answered 1 : 



.Jf CLijil'i/Li'lU 



The last two questions will be asked one to four times, depending on the answer to the previous question. 
If answered 2: 



The last question may be asked one to seven times, depending on the answer to the previous question. 
If answered 3: 



The last question may be asked one to seven times, depending on the answer to the previous question. 
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The following question is asked only if options 2 or 3 were requested: 



The following question will be asked if the LIMIT THE CYLINDERS question was answered Y (within 
options 2 and 3) or if option 4 was selected for the area to test: 



The following questions will be asked if data pattern 16 was selected: 



2.2.2 VAX Subsystem Diagnostics Preparation 

The VAX subsystem diagnostics consist of two programs: ZZ-EVRLA (UDA host-resident diagnostic) and 
ZZ-EVRLC (SDI generic disk exerciser). Also included with the diagnostic kit for the VAX subsystem is 
a disk formatter program (ZZ-EVRLB). The formatter (EVRLB) is not a diagnostic; do not run unless specifi- 
cally instructed. 

ZZ-EVRLC tests the read and write ability of any SDI disk drive and displays differences in the read and 
write data to the operator. 

Before running the subsystem EVRLA diagnostic tests, the system must be set up and tested under the diagnostic 
supervisor. On a VAX, this requires a DW UNIBUS adapter, UDA50 disk controller, and an RAnn disk 
drive to be attached. The following printout shows how to attach these devices: 
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NOTE 
An RA60 must be attached by using the following 
sequence: 



I; E U I C E H ft M E ? B J ft 3 Note the " J " instead of the " U " . 

This is how VMS identifies the drive 
as removable. 

2.3 RUNNING THE HOST-RESIDENT DIAGNOSTICS 

Host-resident diagnostic CZUDC or EVRLA contains tests 1 through 4 linked together to run automatically 
in sequence. However, if an attempt to read or write on the customer data area is desired during test 4, 
manual intervention is necessary = 

A detailed description of these diagnostics is available on microfiche under RA80 diagnostics (CZUDC). 

If a printout of the test progress is wanted, type SET TRACE on a VAX prior to starting the test. 

The following is a sample test printout: 



DS> S ! Af 



r'KUbKH! 



U i' H O '€• 



-J i ! H 



;ts A" 



i3 : 91 



Sl -.JHN' 



SUE IE 



JBTEST 






UiJH HliLiKLi! 

UBA BIhGNO 



J\J~ I'M I i! 



.'A.' i !__. : 



] D A INTER R U F T 3 U E ' EST 
4 UBA I r-l I T I AL I ZAT I ON - SM R I NG B 1 



JBA INITIAL IZA 



3N - lG RING BUFFER SUh i E5 



'J.'H i.'! i rKuuP.ri'l H i." L> i\l_ ■_•■_= A : ■ 



ilj. 



I BENT 
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The disk exerciser diagnostic in test 4 will continue to run until halted with a CTRL C . Type CTRL C to 
return to the diagnostic supervisor prompt (DS>), then type ABORT. If test 1 is successful the problem 
is probably drive related. Tests 2 through 4 should detect the failure. 

2.4 INTERPRETING HOST-RESIDENT DIAGNOSTIC MESSAGES 

The VAX and PDP-11 diagnostics display the same error messages. Error messages take on three distinct 
formats which provide SA register contents, data comparison error information, or real-time drive state and 
status. Consult the drive service manual or maintenance guide for interpretation of the status messages. 



2.4.1 Status/Address (SA) Register Contents 

The following sample error message gives the UDA50 SA register contents: 



For a description of the error and a callout of the most likely faulty FRU, find the SA register contents (100004) 
in Table 2-2. 

2.4.2 Data Comparison Errors 

The UDA can be put into a mode where the UDASA acts as a wrap port. In this mode, any data being sent 
to the UDASA will be displayed within a small period of time. If the data in the UDASA does not match 
the data sent to it, the following error message is displayed. In some instances, a VAX error message prints 
the name of the failing FRU. 



2.4.3 Real-Time Drive State And Status 

In the following sample error message, the last two lines contain the real-time drive state as supplied by 
the drive and a status message: 



:■*' T kif; Ujt : '!' 7T 



The real-time drive state (RTDS) message consists of four hexadecimal digits. Only four state bits within 
these hexadecimal digits are of diagnostic value to the field service engineer. The rest of the bits are too 
transitory and are masked out before the RTDS message is printed. The following are the four important 
state bits: 



• Read/ write ready <R.--N RDY) 

• Drive available <AUhIL> 

• Attention (ATTN) 

• Receiver ready ^RCURRDY) 
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The location of these four state bits within the hexadecimal code is shown in Figure 2- 1 . The interpretation 
of the RTDS message requires an understanding of the causes and effects of each bit. It also requires an 
understanding of drive online, drive offline, drive available and drive unavailable. Definitions of each of 
the four RTDS message bits and the online and available states follow. 
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Figure 2-1 Real-Time Drive State Bits 



The following four terms define the state of the drive as seen from the controller: 



Drive offline — The drive is not operational and may not communicate via the drive control protocol. 

Drive unavailable — The drive is operating, is visible to, and may at times communicate with 
the controller. However, the controller may not fully utilize the drive because it is online to the 
other controller. 

Drive available - The drive is visible to, capable of communicating with, and capable of executing 
an online command. However, the drive is not currently online to either controller. 



Drive online 
to the other. 



The drive is dedicated to the exclusive use of one controller and is not available 



The following paragraphs explain the causes, effects, and relationships among the four state bits within the 
RTDS message: 

• RCVR RDY (receiver ready) — When asserted, this bit indicates the drive is ready to receive 
a command on the SDI interface write/command line. RCVR RDY is negated while the drive 
is processing a command. 

• ATTN (attention) — This notifies the controller that a potentially significant status change has 
occurred in the drive. 

The drive asserts this signal in the online state whenever any of its generic status bits change. 
The following three cases are exceptions to this rule: 

1 . A generic status bit changes as a direct consequence of the correct operation of a command. 

2. A generic status bit changes as the result of an error in the reception, validation, or execu- 
tion of a command. 

3. The ' 'RE' ' status bit changes due to a transmission error outside of a command. The "RE' ' 
bit is described in byte 6 of the drive status message. 

An online drive may assert ATTN regardless of whether a command is in progress or not. The 
drive will continue to assert this signal until it receives a valid GET STATUS command from 
the controller. At this point, the drive will negate the ATTN signal. 
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• 



A spinning drive in the available state always asserts the ATTN signal. The ATTN signal is negated 
if any condition arises that would prevent the available drive from spinning up under controller 
command. 

R/W RDY (read/write ready) - This indicates the drive is capable of handling a data transfer 
to or from the disk surface. 

Upon receipt of the start frame of a command, the drive negates R/W RDY prior to reasserting 
RCVR RDY. The signal will remain negated until the drive has processed the command and has 
transmitted the end frame of the response (if required). 

Any head motion negates this signal until the operation is completed and the drive is again ready 
to perform I/O operations. 

The drive asserts R/W RDY after the successful completion of a seek operation. If the operation 
is unsuccessful, the drive will keep the R/W RDY signal negated and use ATTN to signal the 
problem. 

• AVAIL (available) — When asserted, this bit indicates the drive has entered the drive available 
state. The signal is negated when the drive leaves the available state. 

Table 2-3 describes the possible drive state codes. The error printout example 8001 indicates the signal R/W 
RDY is active and the drive is ready to receive. This represents the normal drive online state of a drive 
in a UDA50 subsystem. 

Table 2-3 Real-Time Drive State Code Interpretation 

RTDS Description 

Hex 

Code 

0000 The drive is either in initialization or in an offline state. 

0001 The drive is online. Possibly an error state was recently cleared, or the drive spun down with 
the RUN/STOP switch out. 

0002 This code indicates an invalid drive state. ATTN is asserted and the drive cannot receive con- 
troller commands with RCVR RDY negated. 

0003 The drive is online and one of two conditions exist: 

1. The disks are spinning, and there is an error state. 

2. The disks are not spinning, and there is a switch change active. 

0040 This code indicates an invalid drive state. RCVR RDY should be asserted if the drive is in the 
available state. 

0041 The drive is available but cannot be spun up. The RUN/STOP switch is not pushed in, or there 
could be an open module interlock preventing spinup. 

0042 This code indicates an invalid drive state. ATTN is asserted and the drive cannot receive con- 
troller commands with RCVR RDY negated. 

0043 The drive is available and capable of being spun up. 

8000 This code indicates an invalid drive state. R/W RDY should not be asserted with RCVR RDY 

negated. 
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Table 2-3 Real-Time Drive State Code Interpretation (Cont.) 



RTDS Description 

Hex 

Code 



8001 This is the normal drive online state. 

8002 This code indicates an invalid drive state. ATTN is asserted and RCVR RDY is negated, 
preventing the drive from receiving controller commands. 

8003 The drive is online and one of two conditions exist: 

1. One of the switches on the drive operator control panel has been pushed. 

2. The drive is reporting a successful retry of a seek with recalibration. 

8040 This code indicates an invalid drive state. R/W RDY and AVAIL should never be asserted 
together. Also, ATTN should be asserted when the drive is available and capable of being spun 
up. 

8041 This code indicates an invalid drive state. R/W RDY and AVAIL should never be asserted 
together. Also, ATTN should be asserted when the drive is available and capable of being spun 
up. 

8042 This code indicates an invalid drive state. R/W RDY and AVAI1 should never be asserted 
together. Also, ATTN is asserted and the drive cannot receive controller commands with RCVR 
RDY negated. 

8043 This code indicates an invalid drive state. R/W RDY and AVAIL should never be asserted 
together. 

FFFF The controller is unable to get a valid drive state. 



2.4.4 Status Message Bytes 

The status line information found in the error message above is the result of the diagnostic performing a 
GET STATUS command. Fourteen of fifteen possible status bytes are printed in the error message. Figure 
2-2 shows the breakdown of the fifteen status bytes. The first byte is not printed because it is a UDA50 
response code to the GET STATUS command. Bytes 9 through 15 contain drive-specific status bits. The 
drive service manual or maintenance guide should be consulted for interpretation. 

Table A-l, in the appendix, describes status bytes 1 through 8 as shown in Figure 2-2. 
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Figure 2-2 Drive Status Bytes 



2.4.5 Status Message Interpretation 

Figure 2-3 shows the breakdown of the status results from the real-time drive state and status sample error 
message. 
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Byte 15 14 

Status: 00 01 



13 12 11 10 

11 00 00 00 



Bytes 9 through 15 contain 
specific status bits. 
Consult appropriate drive diagnostic 
maintenance guide for interpretation. 
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Figure 2-3 Status Message Interpretation 
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Byte 1 is the GET STATUS response code and is not printed out. 

Byte 2 and the lower half of byte 3 comprise a three-digit hexadecimal unit number. In the exam- 
ple, the unit number is 020 (hexadecimal) or 32 (decimal). 

Byte 3 (upper half) reflects the subunit mask and indicates the drive sending the status is subunit 
zero (0001). 
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Byte 4 is the request byte and breaks down as follows: 



i 1 

I OA RR DR SR 



i 00 1 1 

I EL PS RU 



(13H) 



The RU bit is set indicating the drive RUN switch is depressed. 

The PS bit is set indicating the port select switch for the UDA requesting the status is 
depressed. (The drive is available to the UDA50.) 

The SR bit is set indicating the drive spindle is up to speed. 

The OA bit is not set indicating the drive is at a drive available state. If it had been set, 
it would indicate online. 

The RR bit is not set indicating the selected drive does not need an internal adjustment. 

The DR bit is not set indicating the selected drive has no request for an external diagnostic 
to be loaded into it. 



Byte 5 is the mode byte and breaks down as follows: 



0000 

W4 W3 W2 Wl 



110 

DD FU DB S7 



(06H) 



The DB bit is set indicating it is possible for the host to use a diagnostic cylinder on the drive. 

The FO bit is set indicating the drive can be formatted. 

No bits set in the W4 to Wl field indicate no subunit is write protected. 

The DD bit is not set indicating the drive has not been disabled by a controller due to some 
error or diagnostic routine. 

The S7 bit is not set indicating the 512 byte/ sector format is selected for the drive. 

Byte 6 is the error byte, and for this example none of the above errors have occurred. 
(DE-RE-PE-DF-WE). 

Byte 7 is the controller byte, and for this example a normal drive status is observed (C1-C4 = zeros). 
The S4 to SI bits being cleared indicate that the UDA50 is to interrupt the host CPU whenever 
any drive on the subsystem raises its available line to the UDA50. 

Byte 8 is the retry count/failure code, and for this example no retries by the diagnostic have been 
attempted. 
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2.5 SUBSYSTEM ERROR MESSAGES 

In addition to the three error messages above, the subsystem diagnostic tests may also print out one of the 
following: 

• Host error message 

• UDA initialization error message 

• Interrupt handler error message 

• Diagnostic machine error message 

A sample printout of each of these four types of subsystem error messages is given here for the EVRLA 
diagnostics. Each error message contains a number and a description of the cause of the error. 

1 . Sample printout of a host error message: 
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2. Sample printout of a UDA initialization error message: 
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3. Sample printout of an interrupt handler error message: 
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4. Sample printout of a diagnostic machine (DM) error message: 
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Each of the above subsystem error messages gives an error number in the printout. For example, in the 
DM error message sample printout, an error number of 02005 is given. A description of the error may be 
found in Table A-2 in the appendix. 
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2.6 HOST ERROR LOG EXAMINATION 

This section assumes you already know how to run the error log program or you have access to an operator 
who can run error log reports for you. Training courses are available on this topic. 

This section focuses on how to interpret error log reports you might encounter for MSCP/SDI devices. Sam- 
ple error log printouts for both VMS (SYE) and RSTS (ERRDIS) are examined. They report similar infor- 
mation but in different formats. Therefore, decoding charts are provided. Select the error log format that 
applies to your operating system. 

2.7 COMMON ERROR LOG MESSAGE DEFINITIONS 

Even though the error log printouts on different operating systems vary in appearance, there is common 
message information. Read the following definitions before examining the individual sample printouts: 

1 . Command reference number — A command reference number is given for each error log event 
message. It is the MSCP command number that caused the error message to be reported. The 
command reference number is zero if the error message is not related to an outstanding MSCP 
command. When a seek command is issued, the drive attempts a number of retries. Each retry 
for that seek command has the same command reference number. 

2. Drive number (logical unit address) — A unit message number refers to the logical unit number 
of the device. The unit number may be zero if the error message does not refer to a specific device 
or unit. 

3. Sequence number — A sequence number is assigned to an MSCP packet when it is passing infor- 
mation to the host error logger. The use of the sequence number is dependent on the MSCP server 
in the controller microcode. If the sequence number is zero, the controller does not support the 
use of this feature. 

4. Message format — A format code associated with each error log event determines the error message 
format. It is important to determine the format code first to interpret the remainder of the error 
message correctly. The format code will reveal whether the error event is reporting a controller 
error (code 0), a host memory access error (code 1), a disk transfer error (code 2), or an SDI 
error (code 3). Use Table A-3 in the appendix to decode the format of each error log event. 

5. Message Flags - The error log message flags report which of the following three operating 
conditions apply: 

• Operation successful flag — If set, this flag indicates the operation causing this error log 
message has been successfully completed. If clear, this flag indicates the operation is not 
yet successfully completed. 

• Operation continuing flag — If set, this flag indicates the retry sequence for this operation 
is continuing. If clear, the retry sequence for this operation has terminated. Ignore this flag 
status if the operation successful flag is set. If the "operation successful" and the "opera- 
tion continuing" flags are both clear, the error log message is reporting a hard (unrecoverable) 
error. 

• Sequence number reset flag — If set, this flag indicates the MSCP command sequence number 
has been reset by the MSCP server since the last error log message sent. Note, this bit 
is always set if the MSCP server does not implement the error log sequence number feature. 
If clear, this flag indicates the command sequence number has not been reset, implying 
it may be used to detect missing error log messages. Refer to A-4 in the appendix to decode 
these message flags. 
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6. Status/Event code - The status/event code identifies a specific error or event being reported by 
this error log message. Refer to A-5 in the appendix for a list of all the UDA MSCP status/event 
error log codes. 

7. Controller identification — The controller identification message provides the controller unique 
number, controller class, and the controller model. The controller unique number is blasted into 
ROM. The controller class refers to whether it is a mass storage controller or some other disk 
or tape device. Refer to A-6 in the appendix for controller class values and A-7 for the controller 
model values. 

8. Drive identification — The drive identification message contains a drive unique number blasted 
into ROM which is not necessarily the drive serial number etched on the drive S/N tag. It also 
contains a controller class number shown previously in A-6 and a drive model number given in A-8. 

9. Hardware and software revisions - These revision levels are given for both the controller and 
the drive. The values are blasted into ROM for each device. 

10. Pack or HDA serial number — This is the low order 32 bits of the serial number of the HDA 
mounted on the drive. This serial number is written on the media at the factory. The bit field 
is zero if the format of the media does not provide for a media serial number. The bit field is 
an undefined number if the media is not mounted or the serial number cannot be read. 

1 1 . Header or logical block number - This message gives the logical block number (LBN) of the 
physical sector where the error occurred. If the high four bits are 0000 (binary), then the low 
28 bits are the logical block number where the error occurred. If the high four bits are 01 10 (binary), 
the low 28 bits are the replacement block number where the error occurred. 

12. Error recovery level — The error recovery level reflects the most recent attempt at a data transfer. 
Each device has a specified number of error recovery levels that corresponds to the mechanisms 
it has available to attempt an error recovery. For example, if data cannot be read, the drive might 
try offsetting its head position slightly in case it had been altered since the data was written. For 
each such attempt, the error recovery level will be incremented. The values zero and 255 (all 
ones) indicate no special error recovery procedures are used. 

13. Error retry count — This message gives the retry count within the current error recovery level 
of the most recent transfer attempt. This value starts at one and increments for each subsequent 
attempt at the same error recovery level. It continues until a drive dependent maximum number 
is reached, then the retry count is set to one and the next error recovery level (if any) is tried. 

14. Host memory address — This message gives the host memory address being used at the time the 
error was detected. For a UDA50 on a RSTS system, the maximum address will occupy 18 bits 
(0-17). Host memory access errors include UNIBUS parity errors (PA/PB lines) and UNIBUS 
timeouts (SSyn timeout), etc. 

15. MSCP error code — This message is printed out at the bottom of the ERRDIS error log printout 
only. It tells why the MSCP communications between the host and controller failed. Refer to Table 
A-9 in the appendix for a list of the MSCP error codes. 

16. Status code of packet — The status code of the packet is a summary statement of the condition 
which prompted the error log entry. The condition statement is obtained from an analysis of the 
status/event field. This statement is very general and does not give a detailed cause for the error. 
Refer to table A- 10 in the appendix for a list of status codes for the packet. 
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17. SDI status message - An SDI status message is received when the error log is reporting an SDI 
error event (message format 03). In a VMS/SYE error log report, this message is given in the 
form of three 8 character fields. This message contains the contents of bytes 4 through 15 of the 
drive status message shown in Figure 2-2. Bytes 4 through 8 contain controller specific informa- 
tion and bytes 9 through 15 contain drive specific information. Status byte 15 contains the same 
drive error code normally obtained through the hand-held terminal. In the RSTS ERRDIS error 
log, the SDI status information is found in words 22 through 27 of the MSCP packet. This packet 
is printed out in the form of 37 character groups. Refer to the sample printout of the ERRDIS 
error log report. 

2.8 DECODING VMS ERROR LOG REPORTS 

This section shows how to decode the information in a VAX/VMS error log report. Two sample VAX/VMS 
error log printouts are given with note numbers on the right hand side. These numbered notes after each 
sample printout show how to interpret the error log messages. 

2.8.1 Sample Printout Of VMS Error Report Sequence 1 



1! ! F 
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2.8.2 Notes For VMS Error Report Sequence 1 

• Note 1: This is the command number which caused this error to be reported. It will be zero 
if no host command is associated with this error. 

• Note 2: This drive unit number relates to the error log message. 

• Note 3: Presently unused. Will be zero. 

• Note 4: The (02) indicates the format of this message is that of a disk transfer error. Refer to 
A-3 in the appendix. 

• Note 5: The (41) indicates the sequence number reset flag (01) and the operation continuing 
flag (40) are set. Refer to A-4 in the appendix. 

• Note 6: The (00CB) identifies a specific error or event being reported by this error log message. 
The error is LOST RECEIVER READY DURING TRANSFER. Refer to A-5 in the appendix. 

• Note 7: The controller identification message decodes as shown in Figure 2-4. Use Tables A-6 
and A-7 to decode the controller class and controller model. 



00000000 




1 
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CONTROLLER ID MESSAGE 



-*• UNIQUE CNT ID = 000000000000 

-►06= MODEL UDA50 

-*• 01 = MASS STORAGE CNTRL CLASS 
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Figure 2-4 Decode of Controller Identification Message 
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Note 8: This is the microcode version from the UDA. Listed below are the possible codes: 

» 02 = OLD - UDA needs upgrade. 

• 03 = Y09 microcode. 

• 04 = U21 microcode. 

Note 9: Not used. Will be zero. 

Note 10: For a UDA50 the LSB is the port number (0-3). All other bits are zero. 

Note 1 1 : The unit identification message gives the drive unique device number (serial number 
for RA80/RA81 but a unique number for RA60), the device class, and the drive model. Figure 
2-5 shows how to decode the unit identification message. Refer to Tables A-6 and A-8 for the 
controller class and drive model. 
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*■ UNIQUE IDENTIFIER = 000000000482 

04= DRIVE MODEL RA60 
*■ 02= DISK CLASS DEVICE 
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Figure 2-5 Decode of Unit Identification Message 



Note 12: This message indicates the error recovery procedure used. Codes of zero and 255 in- 
dicate no special error recovery is used. 

Note 13: This message indicates the number of retries attempted under the error recovery pro- 
cedure given in note 1 1 . 

Note 14: The volume serial number gives the serial number of the disk media. 

Note 15: The header message gives the physical location where the error occured. If the MSB 
is a 0, the header is from LBN space. If the MSB is a B, the header is from RBN space. 
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2.8.3 Sample Printout Of VMS Error Report Sequence 2 
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2.8.4 Notes for VMS Error Report Sequence 2 

• Note 1: The command reference number for error sequence 2 is the same as error sequence 
1. This means both error events are related to the same MSCP command. 

• Note 2: This error report has a message format code of (03). A-3 in the appendix shows code 

03 is an SDI error report message format. The last message on this report (MSLG $Z ) gives 

three lines of SDI error code information interpreted in note 4. 

• Note 3: The MSLG $W EVENT message is reporting a hexadecimal status/event code of 

(00EB). Refer to A-5 in the appendix. A-5 shows a drive detected error. This information is also 
given in the verbal description on the right side of the VMS error log printout. It is important 
to learn how to check these status/event codes because not all error log reports may give this English 
interpretation on the printout. 

• Note 4: The MSLG $Z SDI message gives valuable controller-specific and drive-specific 

troubleshooting information. This information comes in three lines of hexadecimal characters shown 
in Figure 2-6, the Decoding VAX/ VMS Error Report SDI Message. 

Figure 2-6 shows the information contained in each byte of the VAX/ VMS error report SDI message. Each 
byte contains two hexadecimal characters that must be decoded further. In this example, there is controller 
specific information available in bytes 4 through 8. Use the bit map on the right side of Figure 2-6 for inter- 
pretation. Then use Table A-l to decode the meaning of each controller bit mnemonic. 



MSLGSZ-SDI MESSAGE 



BYTE 7 
BYTE 11 
BYTE 15 



4 4 


1 B 


3 3 E D 





A 1 4 





3 A 



BYTE 4 
BYTE 8 
BYTE 12 



BYTE 4 THRU 8 CONTAIN 
CONTROLLER SPECIFIC INFORMATION 
REFER TO TABLE A-1 





-MSB 


> 


r 


LSB 


BYTE 4 




1R 




BYTE 5 


04 


BYTE 6 


40 


BYTE 7 


00 


BYTE 8 


00 



MSB 














LSB 


OA 


RR 


DR 


SR 


EL 





PS 


RU 


W4 


W3 


W2 


W1 


DD 


FO 


DB 


S7 


DE 


RE 


PE 


DF 


WE 











S4 


S3 


S2 


S1 


C1 


C2 


C3 


C4 


— RETRY COUNT/FAIL CODE — 



BYTES 9 THRU 15 CONTAIN DRIVE SPECIFIC INFORMATION, MESSAGE 
CONTENT IS DIFFERENT FOR RA60 THAN FOR RA80/RA81. SEE THE 
DECODE CHART BELOW. 



RA80/RA81 



RA60 



BYTE 9 


— ED 


BYTE 10 


— 03 


BYTE 11 


— 03 


BYTE 12 


— 3A 


BYTE 13 


— 00 


BYTE 14 


— 04 


BYTE 15 


— A1 



■LAST POSITION CMD 

-SDI ERROR STATUS 

■CURR CYL(LO) 

■CURRCYL(HI) 

-CURR (GROUP) 

MICROPROCESSOR LEDS- 
-FRT PANEL FAULT CODE- 



•PREVCYCL(LO) 
■PREVCYL (Hl)- 

•PREV HEAD 

CURR CYL (LO)- 
■CURR CYL (Hl)- 

■CURR HEAD 

■ ERROR CODE — 



NOTE: ALL CYLINDER REFERENCES IN THE ABOVE DRIVE CHARTS ARE TO 
PHYSICAL CYLINDERS. 
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Figure 2-6 Decoding VAX/VMS Error Report SDI Message 
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Figure 2-6 also shows the drive- specific information contained in bytes 9 through 15. Determine which model 
drive unit is reporting this error message before you decode the extended status area information found in 
bytes 9 through 15. 

2.9 DECODING RSTS/E ERROR LOGS 

The RSTS/E ERRDIS error log report is not as sophisticated as the VMS error log report. Instead of giving 
separate error log messages with verbal descriptions, the RSTS/E error log prints out the contents of an 
MSCP packet that must be decoded. 

For the following example, it is easier to convert the MSCP messages from their octal format to hexadecimal 
format. 

This section shows how to decode the information in a RSTS/E ERRDIS error log report. A sample RSTS/E 
error log report is given with note numbers on the right-hand side. Refer to the numbered notes following 
the sample printout to interpret the error log message. 

2.9.1 Sample Printout Of RSTS/E Error Log Report 



* Si * * Si Si Si Si Si Si Si Si Si Si Si Si S Si Si S Si Si Si Si Si .Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si Si S Si Si i 



Si Si Si Si Si Si S* '^ Si ^ "Si* ■& Si "^ '^ ' "^ ^ "^* *^* Si Si Si Si *&• Si Si Si Si Si "^ Si "^ "^* S" S "^ Si ^ ^ ■=£■ Si Si Si Si Si Si '& ■ ^ ■& ■ "^ "^ **&■ Si Si S" S S* ^ 



? Si Si Si Si Si Si Si Si : 






\r\ npr 



y y y y >' y 


yyy020 






[NOTE 2 3 


004000 


1 4yii34 


000002 


y y h y\ ki k"; 




R4H4H3 


H H H H !~i .-■ 


HHfiFiHH 


000000 




000000 


000436 


000004 


000000 




hhhh: 3 ;^ 


HUH HUH 


HHHHH0 


Fifl] fifi4 


r f.j n t r ==; ~\ 


107001 


HHHHHH 


fiMH4fiH 


y y o ^ 




00000 


000000 


002023 


y y y h h h 




yj 1 p u + i-ii yj 


i-u i-i n -j -j cr 
I'!' Kj Kf ''~ ! £.' -J 


1-4 1- -. .- en- t 

KiKi j. n.J.j 


000000 




000000 


000000 








FT T! 


p t i. } r p p s r • p 






r hmte" 3 ] 
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2.9.2 Notes For RSTS/E Error Log Report 

• Note 1 : This line of the sample error log printout reads RA80 for all RA80, RA81, and RA60 
drive errors. 

• Note 2: Note the last two digits of the second MSCP envelope word (000020). If they read 20, 
as shown here, the following MSCP packet contains error information. If the last two digits read 
01, the MSCP message is an end packet and contains limited useful information. Do not use this 
document when the envelope word indicates this is an end packet. 

• Note 3 : The status code of the packet message is reporting the specific error or event that causes 
this error log report. If a coded message is given instead of a verbal description (for example, 
drive error), refer to Table A-5 in the appendix to interpret the message. 

• Note 4: The MSCP error code reports conditions that cause a failure in MSCP communications. 
Refer to A-9 in the appendix for the list of possible causes. 

• Note 5: If the second word of the MSCP envelope ends in 20 (note 2), the 30 octal words in 
the MSCP packet contain useful error information. Figure 2-7 shows how the MSCP packet message 
is organized into words. Word 4 should be decoded to determine the error message format before 
the error information is decoded. The next paragraph shows you how to determine the message 
format. 



1SCP PACKET 



004000 


140234 


000002 


000000 


040403 


000053 


000000 


000000 


000000 


000406 


000004 


000000 


000035 


000000 


000000 


001004 


107001 


000000 


000400 


000004 


000000 


000000 


002023 


000000 


013400 


000005 


001653 


000000 


000000 


000000 


TWO WORDS 
ALWAYS ZERO 



WORDS 0-3 
WORDS 4- 7 
WORDS8- 11 
WORDS 12- 15 
WORDS16- 19 
WORDS 20 -23 
WORDS 24 -27 
WORDS 28 -29 

CX-081B 



Figure 2-7 MSCP Packet Word Organization 
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2.9.3 Decoding MSCP Packet Message Format 

The error information in the MSCP message packet may come in one of four kinds of formats. It is important 
to determine the format of the message first. The four error message formats are: 



Controller error format (code 0) 
Host memory access error format (code 1) 
Disk transfer error format (code 2) 
• SDI error format (code 3) 

The error message format codes are found in the low byte of word 4 of the MSCP packet. Figure 2-8 shows 
how to interpret the format code from word 4 in the sample RSTS/E error log printout given previously. 
This procedure involves converting the octal coded message into its hexadecimal equivalent. Remember the 
message only provides useful information if the second word of the MSCP envelope equals 20 (hex). 



WORD 4 = 4 4 3 OCTAL 



i 



4 4 3 


100 000 1 


00 000 011 


HIGH BYTE 


LOW BYTE 


0100 0001 


0000 0011 


4 1 


3 



OCTAL 

BINARY CODED OCTAL 

BINARY CODED HEX 
HEX 

LOW BYTE OF WORD 4 = HEX 03 (ERROR FORMAT) 
HIGH BYTE OF WORD 4 = HEX 41 (MESSAGE FLAG) 

THIS MSCP MESSAGE IS IN AN SDI ERROR FORMAT BECAUSE 
THE LOW BYTE OF WORD 4 CONTAINED A HEX CODE OF 03 

CX-082A 

Figure 2-8 Decoding MSCP Message Error Format 

2.9.4 Recovering Error Information In MSCP Packet 

After the error format of the MSCP packet is determined, one of the following four error format charts is 
used to interpret the rest of the message. The contents of each word may be decoded and found in the appendix 
tables. If the tables do not give the octal code, convert the octal word to its hexadecimal equivalent. 

2.9.4.1 SDI Error Format Chart - The SDI error format, as determined from the low byte of word 
4, is used by the SDI-type disk controllers to report drive detected errors and SDI communication (drive 
bus) errors. Since the controller may retry a failed command, separate error log entries are recorded for 
each attempt. Each retry for the same command has the same command reference number (words and 
1). If recovery from the error condition is unsuccessful, the controller may declare the drive inoperative 
and mark it offline. The number of retries are different for each disk drive. 

The chart in Figure 2-9 is used to interpret the MSCP packet for the SDI error format. The tables in the 
appendix show how to decode the packet words. 
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|1 


2 


3 


COMMAND REFERENCE NUMBER 


DRIVE NUMBER 
(LOGICAL UNIT 


SEQUENCE 
NUMBER 






ADDRESS) 




4 HI 


LOW 


5 


6 


7 


BYTE 


BYTE 


STATUS/EVENT 


LOW WORD OF 


MIDWORD OF 


MSG 


FORMAT 


CODE 


CONTROLLER SERIAL NUMBER 


FLAG 


CODE 


(GOOD INFO) 






8 


9 HI 


LOW 


10 HI 


LOW 


11 


HIGH WORD OF 


BYTE 


BYTE 


BYTE 


BYTE 




CONTROLLER 


CNTRL 


UDA 


UDA 


UDA 


RESERVED 


SERIAL 


CLASS 


MODEL 


HDWR 


wCODE 


(N/A) 


NUMBER 






VER 


REV 




12 
LOW WORD OF 


13 

MID WORD OF 


14 
HIGH WORD OF 


15 HI 
BYTE 


LOW 
BYTE 


DF 


^IVE SERIAL NUMB 


ER 


DRIVE 
CLASS 


DRIVE 
MODEL 


16 HI 


LOW 


17 


18 


19 


BYTE 


BYTE 




LOW WORD OF 


HIGH WORD OF 


DRIVE 


DRIVE 


RESERVED 






HARDW 


hCODE 


(N/A) 


PACK/HDA SERIAL NUMBER 


VER 


REV 








20 


21 


22 


23 


LOWWORDOF HIGH WORD OF 


UDA STAT 


JS/ERROR 


LOGICAL BLC 


DCK NUMBER 


INFORMA1 


"ION 


24 HI 


LOW 


25 


26 


27 


BYTE 
DRIVE 


BYTE 
DRIVE 


DRIVE ST/ 


a>TUS/ERROR INFO 


RMATION 


STATUS/ 


RETRY 








ERROR 


COUNT 









WORDS 



0-3 



4- 7 



8- 11 



12- 15 



16- 19 



20- 23 



24-27 
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Figure 2-9 MSCP Packet for SDI Error Format 



2.9.4.2 Disk Transfer Error Format Chart - The disk transfer error format, as determined from the 
low byte of word 4, is used by the SDI-type disk controllers to report errors that occur during a disk transfer. 
This format is generally used to report the results of a series of retries. Each retry is recorded by the error 
log program with the same command reference number. If the retries are unsuccessful, the controller may 
declare the drive inoperative and mark it offline. The chart in Figure 2-10 shows how to interpret the MSCP 
packet for the disk transfer error format. The tables in the appendix show how to decode the packet words. 
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o h 


2 


3 I 


COMMAND REFERENCE NUMBER 


DRIVE NUMBER 


SEQUENCE 








(LOGICAL UNIT 


NUMBER 








ADDRESS) 






4 HI 


LOW 


5 


6 


7 




BYTE 


BYTE 


STATUS/EVENT 
CODE 


LOW WORD OF MID WORD OF 




MSG 


FORMAT 




CONTROLLER SERIAL NUMBER 




FLAG 


CODE 


(GOOD INFO) 








8 


9 HI 


LOW 


10 HI 


LOW 


11 




HIGH WORD OF 


BYTE 


BYTE 


BYTE 


BYTE 






CONTROLLER 


CNTRL 


UDA 


UDA 


UDA 


RESERVED 




SERIAL 


CLASS 


MODEL 


HDWR 


uCODE 


(N/A) 




NUMBER 






VER 


REV 






12 


13 


14 


15 HI 


LOW 




LOW WORD OF 


MIDWORD OF 


HIGH WORD OF 


BYTE 


BYTE 








DRIVE 


DRIVE 




DRIVE SERIAL NUMB 


ER 


CLASS 


MODEL 




16 HI 
BYTE 


LOW 
BYTE 


17 HI 
BYTE 


LOW 
BYTE 
ERROR 
RECOV- 
ERY 
LEVEL 


18 
LOW WORD OF 


19 

HIGH WORD OF 




DRIVE 

HARDW 

VER 


DRIVE 
i/CODE 
REV 


ERROR 
RETRY 
COUNT 


PACK/HDA SERIAL NUMBER 
I 




20 


21 






LOW WORD OF 


MIDWORD OF 


A VARIABLE AMOUNT OF CON- 
TROLLER OR DISK DEPENDENT 




LOGICAL BLOCK NUMBER 

I 


INFORMATION. THE LENGTH OF 
THIS MESSAGE IS DEPENDENT ON 
THE LENGTH OF THE ERROR LOG 
MESSAGE SENT TO THE HOST 








SOFTWARE BY THE CONTROLLER M 


CROCODE. OFTEN NO CONTROLLER 




OR DISK DEPENDENT INFORMATIOIS 


IS PROVIDED. THIS INFORMATION 




WILL TYPICALLY NOT BE INTERPRETED BY THE ERROR LOG PROGRAMS, 




ANDWILLTHUSBE PRINTED AS A St 


ERIES OF OCTAL VALUES. 





WORDS 



4-7 



8- 11 



12- 15 



16- 19 



20-23 



24-27 
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Figure 2-10 MSCP Packet for Disk Transfer Error Format 



2.9.4.3 Controller Error Format Chart - The controller error format, as determined from the low byte 
of word 4, is used by the SDI-type disk controllers to report errors that occur within the controller. The 
failing operation may be retried. The number of retries is a function of the type of error, the type of drive, 
and the type of controller. The results of each retry will be logged by the error log program with the same 
command reference number (words and 1). The chart in Figure 2-11 shows how to interpret the MSCP 
packet for the controller error format. The tables in the appendix show how to decode the packet words. 
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o h 


2 


3 


COMMAND REFERENCE NUMBER 


RESERVED 


SEQUENCE 






(N/A) 


NUMBER 


4 HI 


LOW 


5 


6 


7 


BYTE 


BYTE 


STATUS/EVENT 


LOW WORD OF MID WORD OF 






CODE 


CONTROLLER SERIAL NUMBER 


MSG 


FORMAT 






FLAG 


CODE 


(GOOD INFO) 






8 


9 HI 


LOW 


10 HI 


LOW 




HIGH WORD OF 


BYTE 


BYTE 


BYTE 


BYTE 




CONTROLLER 


CNTRL 


UDA 


UDA 


UDA 




SERIAL 


CLASS 


MODEL 


HDWR 


wCODE 




NUMBER 






VER 


REV 




A VARIABLE AMOUNT OF CONTROLLER OR DISK DEPENDENT 


INFORMATION. THE LENGTH OF THIS MESSAGE IS DEPENDENT 


ON THE LENGTH OF THE ERROR LOG MESSAGE SENT TO THE 


HOST SOFTWARE BY THE CONTROLLER MICROCODE. OFTEN NO 


CONTROLLER OR DISK DEPENDENT INFORMATION IS PROVIDED. 


THIS INFORMATION WILL TYPICALLY NOT BE INTERPRETED BY 


ERROR LOG PROGRAMS, AND WILL THUS BE PRINTED AS A 


SERIES OF OCTAL VALUES. 



WORDS 



0-3 



4-7 



8- 11 



12- 15 



16- 19 



20-23 



24-27 
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Figure 2-11 MSCP Packet for Controller Error Format 



2.9.4.4 Host Memory Access Error Format Chart - The host memory access error format, determined 
from the low byte of word 4, is used by the SDI-type disk controllers to report errors that occur while at- 
tempting to access host memory. The failing operation may be retried. The number of retries is a function 
of the controller. The results of each retry are logged by the error log program with the same command 
sequence number (words and 1). Use the chart in Figure 2-12 to interpret the MSCP packet for the host 
memory access error format. Refer to the tables in the appendix to decode the packet words. 
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1 1 


2 


3 


COMMAND REFERENCE NUMBER 


DRIVE NUMBER 


SEQUENCE 






(LOGICAL UNIT 


NUMBER 






ADDRESS) 




4 HI 


LOW 


5 


6 7 


BYTE 


BYTE 


STATUS/EVENT 


LOW WORD OF MID WORD OF 






CODE 


CONTROLLER SERIAL NUMBER 


MSG 


FORMAT 






FLAG 


CODE 


(GOOD INFO) 






8 


9 




10 HI 


LOW 


11 


HIGH WORD OF 


HI 


LOW 


BYTE 


BYTE 


RESERVED 


CONTROLLER 


BYTE 


BYTE 


UDA 


UDA 


(N/A) 


SERIAL 


CNTRL 


UDA 


HDWR 


mCODE 




NUMBER 


CLASS 


MODEL 


VER 


REV 




12 


13 




HOST MEMORY ADDRESS 




UNUSED 



WORDS 



0-3 



4-7 



8- 11 



12- 15 



16- 19 



20-23 



24-27 
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Figure 2-12 MSCP Packet for Host Memory Access Error Format 



2.9.5 Status Word Availability 

The status words contain important controller status, drive status and error information. These status words 
are only available when the error log program is reporting an SDI error format message (format code 03). 
The availability of the status words is determined by checking which error format code the error log event 

is reporting. In the VMS error log report, this is determined by reading the MLSG $B FORMAT line. 

In a RSTS/E error log report, word 4 of the MSCP packet must first be decoded as shown in Figure 2-9. 
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2.9.6 Decode Status Words 

Decode the error log report as an SDI error format type (format code 03). SDI error format reports contain 
useful information on the UDA and disk drive in the SDI status message. 

For VMS error log reports, the SDI status word information is found on the MSLG $Z SDI line. It is 

reported as three lines of hexadecimal data and shown in decoded form in Figure 2-6. 

For RSTS/E error log reports, the SDI status word information is given in words 22 through 27 of the MSCP 
packet. This information is given in an octal word format and is decoded as shown in Figures 2-7, 2-8, and 2-13. 

After the SDI status information is decoded, the table codes found in the appendix determine the failing FRU 
to replace. 



WORD 22 



«« MODE BYTE 












EQUESTBYTE - 












W4 


W3 


W2 


W1 


DD 


FO 


DB 


S7 


0A 


PR 


DR 


SR 


EL 


00 


PS 


RU 



WORD 23 



«*— CONTROLLER BYTE ^ 






ERROR BYTE 












S4 


S3 


S2 


SI 


C1 


C2 


C3 


C4 


DE 


RE 


PE 


DF 


WE 


00 


00 


00 



WORD 24 



HIGH BYTE 


LOW BYTE 


RA60= PREV. CYL (LO) 
RA80/81 = LAST POSITION 
COMMAND ISSUED 


RETRY COUNT 
FOR ALL DRIVES 



WORD 25 



HIGH BYTE 


LOW BYTE 


RA60= PREV. HEAD 
RA80/81 = CURR CYL(LO) 


RA60 = PREV. CYL. (HI) 
RA80/81 = SDI ERROR STATUS 



WORD 26 



HIGH BYTE 


LOW BYTE 


RA60= CURR CYL (HI) 
RA80/81 = CURRENT GROUP 


RA60= CURR CYL. (LO) 
RA80/81 = CURR CYL (HI) 



WORD 27 



HIGH BYTE 


LOW BYTE | 


RA60= DRIVE ERROR CODE 
RA80/81 = FRONT PANEL 
FAULT CODES 


RA60= CURRENT HEAD 
RA80/81 = MICRO PROCESSOR 
LED CODES 
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Figure 2-13 UDA and Drive Status/Error Words 22 through 27 
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2.10 DECODING RSX ERROR LOGS 

This section shows how to decode the information in an RSX-1 1 error log report. Two samples of RSX-1 1 
error log printouts are given with note numbers on the right hand side. These numbered notes show how 
to interpret the error log messages after each sample printout. 

The first example is of an SDI error format and the second is of a disk transfer error format. The controller 
error MSCP error log packet format and the host memory access MSCP error log packet format are not 
shown as examples; rather, the differences between them and the examples are pointed out. 

2.10.1 Sample Printout Of RSX-11 Error Log Report (SDI Error Format) 



ftSK- 1 iv 


1/M-rLUS 


E K K C 


R LOGGING S'\ 


STEM 


J 2 . SO 


Ti ; i _ 


mn-'-n 




'' HH : f 


- Kl ~j i-r ; V 1 


4H 4 HF 


!■: U E H 


JE3S. DU0@( 














DEUICE 


■ESShiGE 




JDLE MOTOR li 


"1 I t. K L. 


DCK EF 


KEN 


) DD-M 


■1M 


" i i ! 


Lj„! 

i' l"i 


TBFHTTFJ 


CRT I 


flN : 














Svc 


T L" hi 




TTiFNT 


~- p i"i r- p 1 


ecno 




K-j i^i P O T j. 


ji"j 





rw 



hJ .-Ci 



1. 1 M ■•■ M - P L Li S E P P R L G G I H L 



00 DD-MMM 



■i .-1 |~i .H 
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DEUICE supplied information 



UAI I IF 



INTERPRE- 
TATION 



00004808070 MESSAGE ENUELOPE FIELD 



[31 
r •? t 

[19 
[15 



24 3 CONNECTION I . D . =MSCP DISK 
20] MESSAGE TYPE == DATAGRAM MESSAGE 
ibj CREDITS = 8. 
8 3 MESSAGE LENGTH = 28. WORDS 



[NOTE 5 3 



0131300000000 COMMAND REFERENCE NUMBER FIELD 

[31 ■ 3 ERROR DOES NOT RELATE TO A COMMAND 



[NOTE 6 3 



Hj.-iHi--ii-ii.-1 



UNIT NUMBER F I ELB 
[15= 0] UNIT NUMBER = 



HHfififiH 



MESSAGE SEQUENCE NUMBER 

[ 15 = 3 ERROR PACKET IS COMPLETE 



HH3 



MESSAGE FORMAT FIELD 
*[ 7 : 3 SDI ERROR 



[NOTE 7 3 



181 



FLAGS FIELD 

[ b 3 P E R A T 1 H C N T I H U I N G 

[ 8 3 SEQUENCE NUMBER RESET 



[NOTE S3 



HHH^riS 



EUENT CODE FIELD 

*:[ 15 : 5 3 SUBCODE = DRIUE DETECTED ERROR 

*[ 4= 83 MAJOR STATUS = DRIUE ERROR 



[NOTE 9 3 



441DA52D1 



CONTROLLER I . D. (S..-H) = 441DA52D1S88 OO 



fifiA 



::OHTROLLER MODEL FIELD 
[7^ 8 3 MODEL = UDA58 



881 



CONTROLLER CLASS FIELD 

[ 7 : 83 CLASS = MASS STORAGE CONTROLLERS 



884 



C H T R L L E R S F T W A R E U E R S 1 H N U M E E R F I E L D 
[7=83 UDA58 FIRMWARE UERSION = 4 . 



[NOTE 183 



HHH 



C N T R L L E R H A R D W ARE U E R S 1 H N U M E E R F I E L D 
[ 7 : 8 3 UDA58 HARDWARE UERSION = 8 . 



HHI.Ji.-1tii.-1 



MULT I UN IT CODE FIELD 
[ 7 : 8 3 ACCESS PATH = 8. 
[15:123 SHARED SP I NDLE = O 



;ce 1000088 



hh5 



UNIT I .D. <S/H>=CC 

UNIT MODEL FIELD 
[ 7: 83 MODEL = RA81 



;E188888888 



[NOTE 113 
[NOTE 123 



2-33 



r KJ t P Y 1 8 4 ■=+ (' f: H T I N U E ii ''■> 



8 3 CLASS == EI SK 



,L Hi- 



ll CI 



fiH, 



HIT SOFTWARE UERS I OH NUMBER FIELD 
[ 7 : ] F I RMWARE UERS I OH = 7 . 



14: 



104 



UNIT HARDWARE UERS I OH NUMBER FIELD 
[7=81 HARDWARE UERS I OH = 4. 



HE 15] 



fii-Jfififii-ji 



GROUP FIELD 

[ 15 : 8 J GROUP = 8 



fifififififi' 



UOLUME SERIAL NUMBER FIELD 

[31 = 8] UOLUME SERIAL NUMBER = 7158. 



kifiTr i 



HHHflfltlMHHHH 



HEADER UALUE FIELD 
[27 = 0] LBH=0. 






NOTE: The following bytes are entered into the error log entry from information provided 

from the drive. This information is obtained from the drive via a host GET STATUS 
command. 



882813 



SDI SUPPLIED STATUS FIELD 

C 18 ] FORMAT OPERATION ENABLED 

CS ] 512 SECTOR FORMAT 

:*[ 4 3 SPINDLE HOT READY 

C 3 ] EL BIT SET 

[ 1 ] PORT SWITCH IN 

C 8 ] RUN SWITCH OH 



[h 



i U I u. i :? J 



fififi^HH 



SDI SUPPLIED STATUS FIELD 

[11= 8] CONTROLLER C FLAGS = NORMAL OPERATION 

* [ 7 3 D R I U E E R R R (. D R I U E F A U L T > 

* [ 6 3 T R A H S M I S S 1 H E R R R 



[NOTE 28! 



flHH 



RA38/81 STATUS BYTE 8 

[7=83 SEEK & RECAL RETRY COUNT = 6 



[NOTE 21 



812 



RAS8,-81 STATUS BYTE 9 [BAD;. SEE NOTE A BELOW 3 
[ 7= 83 RETRIED OP-CODE ==SET UNIT CHARACTERISTIC 



yyy 
187 



R A 88, -81 STATUS BYTE 18 
R A 88, -81 STATUS BYTE 11 
[7=83 CYL ADDR LOW BYTE? PRIHTED IN NEXT FIELD 



CHOTE 22 3 



kldfi 



RAS8,-81 STATUS BYTE 12 

[ 7= 8 3 CYL I HDER ADDRESS ==71 



881 



RA88,-S1 STATUS BYTE 13 
[ 7= 8 3 CURRENT GROUP ===== 1 
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4 3 R A 3 ,- 8 1 S TAT U S E Y 1" E 1 4 C H T E 2 3 3 

C 7 : ] LEDERRORC D E = S P I H D L E M T . INTER L C i< E R K H 

034 RR80,-Si STATUS BYTE 15 [NOTE 24 3 

[ 7 : 3 CONTROL PANEL FAULT CODE = SB I ERROR 

NOTE A: 

RSX interprets this field (RA80/81 Status Byte 9) incorrectly. The field should display the last SDI com- 
mand issued to the drive. Instead, RSX is decoding it as an MSCP Command. It should have been decoded 
SEEK (SDI opcode 012). The proper decoding for byte 9 is shown in Table 2-4. 

Table 2-4 RA80/81 Status Byte 9 Decode 



Octal 


Hex 


Command 


Octal 


Hex 


Command 


003 


03 


Diagnose 


207 


87 


Get Common 


005 


05 


Drive Clear 






Characteristics 


006 


06 


Error Recovery 


210 


88 


Get Subunit 


012 


0A 


Initiate Seek 






Characteristics 


014 


OC 


Run 


213 


8B 


On-Line 


017 


OF 


Write Memory 


215 


8D 


Read Memory 


201 


81 


Change Mode 


216 


8E 


Recalibrate 


202 


82 


Mod Ctl Flags 


220 


90 


Topology 


204 


84 


Disconnect 


777 


FF 


Select Group 



2.10.2 Notes For RSX-11 Error Log Report (SDI Error Format) 

• Note 1 : The DEVICE MESSAGE is a unique RSX feature that gives a good description of the 
failure. 

• Note 2: The DEVICE and TYPE fields supply Logical drive address and drive type information. 

• Note 3: The PACK S/N and DRIVE S/N fields give the Pack/HDA serial number and drive 
serial number. 

• Note 4: DEVICE FUNCTION describes what operation was being performed when the error 
occurred. The TYPE OF ERROR field is the same information previously decoded in the DEVICE 
MESSAGE (see note 1). 

• Note 5: There are two message types: 

• Datagram message — These usually contain serious failure information. 

• Sequential message - These usually contain status information. 

• Note 6: A COMMAND REFERENCE NUMBER is given to each command the host issues 
the UDA. It is possible to have the same number in multiple error log entries if the command 
caused more than one error or if the UDA did retries. 

• Note 7 : The MESSAGE FORMAT defines the type of MSCP error log entry this entry decribes . 
It also specifies how the host is to interpret the various message fields (see Figure 2-8). 

• Note 8: See Table A-4 (Error Log Message Flags) in the appendix. 

• Note 9: The EVENT CODE gives a good description of a controller detected error. However, 
if the error is drive detected, the drive specific information in the SDI SUPPLIED STATUS FIELD 
will be the best source of information. 
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• Note 10: The UDA50 firmware version must be 003 or higher. 

• Note 1 1 : The Unit I. D. is the drive serial number. Note that RSX prints the serial number in HEX. 

• Note 12: The UNIT MODEL FIELD is the drive type, i.e. RA81, RA60, etc. 

• Note 13: In general, the UNIT CLASS is either a disk drive or a tape. However, the UDA50 
supports only disk drives. Therefore, for the UDA50, the UNIT CLASS has to be a disk class device. 

• Note 14: The UNIT SOFTWARE VERSION NUMBER FIELD is the drive microcode version. 

• Note 15: The UNIT HARDWARE VERSION NUMBER FIELD is the drive hardware version. 

• Note 16: N/A (reserved). 

• Note 17: Pack/HDA serial number. 

• Note 18: The HEADER VALUE FIELD is the Logical Block Number (LBN) of the sector being 
accessed when the error occurred. 

• Note 19: The SDI SUPPLIED STATUS FIELD represents the state of the drive when the error 
occurred (word 22 of Drive Status/Error). (See Table A-l in the appendix along with Figure 2-13.) 

• Note 20: More state of the drive information (word 23 of Drive Status/Error). 

• Note 21: STATUS BYTE 8 is the retry count for drive initiated retries. 

• Note 22 : STATUS BYTES 10 through 1 3 represent the drive detected SDI errors (see Figure 2-13). 

• Note 23: This is the best information for drive detected errors. Note that the error is listed as 
043 octal. The drive error code charts are published in hexadecimal, so it's easier to convert the 
octal number to hexadecimal and then look up the error in the relevant drive maintenance guide. 
In this case, 043 octal = 23 hexadecimal. 

• Note 24: This is the error code displayed on the front panel switches. 

2.10.3 Sample Printout Of RSX-11 Error Log Report (Disk Transfer Format) 

The UDA50 uses the DISK TRANSFER ERROR log format to report transfer (data) errors. The controller 
may attempt to retry the failing command, so multiple entries with the same Command Reference Number 
and possibily the same error may occur. 

NOTE 
This example assumes the previous example (RSX SDI 
error format) has been reviewed. All the fields in this 
example are not explained. The previous example con- 
tains additional information for fields that you need 
more information on. 



ENTRY 1138.4 SEQUENCE I . DU080 = 

D E U I C E M E S S AGE ( S I X S V M B0LECCER R R ) D D - Pi M M - Y V Y V H H : M M : S S 

SYSTEM I SENT I F I Cte"*" I ON : 



Tb ! t.i'l 



iiJLh ! 



.. i i hi _ d ; 



i hB 



i-'l J. A .-■ i 



: 'ft i 
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!!~! i ! MN 



<HULL LABEL > 



ir'i-r': :W 



!■•■! u R D S T R A N S F" E P P E D 
1 4 6 5 24672 



TLiNULK:: 



■ ■__ •.■• a ■_.■ i_ 



U H C 7 1 H I V P E OF E K R 



■ t. LKKU K r' 



\V\H \ iUPi 



N,-A N.--A N,-A H.--H 37 ? 541 
RSY- 1 1M,-M-PLUS ERROR LOGGING SYSTEM U2 . OS DD-MMM- YYYY HH : MM 

E N 7 R Y 103.4 < C N 7 1 N U E D > 
DEUICE SUPPLIED INFORMATION: 
UALUE I NTERPRETAT 1 OH 
00004000654 MESSAGE EHUELOPE FIELD 

[31 = 24] CONNECTION I . D . = MSCP DISK 

[23 : 20] MESSAGE TYPE = DATAGRAM MESSAGE 

E 19 = 16] CREDITS = 0. 

[15= 0] MESSAGE LENGTH = 22. WORDS 

1 1 1 35030454 COMMAND REFERENCE NUMBER F I ELD 
[31 ^ 0] COMMAND REFERENCE NUMBER 



MM MM MM 



0MMfifi0 



UNIT NUMBER FIELD 
[15= 0] UNIT NUMBER = 0. 

MESSAGE SEQUENCE NUMBER 

[15= 3 ERROR PACKET IS COMPLETE 
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RH.- 



HHH.4S.- 



m r ■- '- Ci r c r n p m & t f t it s t-i 

i il-j-jHul. r URi in i r iLLi' 

w.l 7 = 0] DISK TRANSFER ERROR 

FLAGS FIELD 

[ 7 J OPERATION SUCCESSFUL 

C O ] SEQUENCE NUMBER RESET (END OF LOGGED PACK 

EUENT CODE FIELD 

*[ 15 = 5] SUBCODE = SIX SYMBOL ECC ERROR 

xC 4= 0] MAJOR STATUS = DATA ERROR 



[NOTES] 



:no' 



441DAS2D1808 CONTROLLER I . D . CS/H) = 441DA52D1800 <X> 



HHh 



i"ifi 



r n K! Tpni i r p m n Ti it : nn p 
■_• ■ j 1 1 ! r-. •-• !— i_ l_ r-. 1 1 u i.' i_ i_ r J. i_ i_ i.' 

[ 7 : 0] MODEL = UDA50 

CONTROLLER CLASS FIELD 

[ 7 : 0J CLASS = MASS STORAGE CONT 






004 CONTROLLER SOFTWARE UERSION NUMBER FIELD 
[ 7 : 0] UDA50 FIRMWARE UERSI0H = 4 . 

00O CONTROLLER HARDWARE UERSION NUMBER FIELD 

[ 7 = 3 UDA50 HARDWARE UERSION = O . 

O O O M U L T I U H IT CODE F I E L D 

[7= 0] ACCESS PATH = y. 
[15= 12] SHARED SP I NDLE = 0. 

C C E 1 U H I T I . D . ( S ,- N > = C C E 1 O O 6 O O <■. X > 

005 UNIT MODEL FIELD 
[7=0] MODEL = R AS 1 



NOTE 4] 



fifi'P 



007 



004 



nfiyuyfi 



UNIT CLASS FIELD 

[7=0] CLASS = D I SK CLASS DEU I CE 

U H I T S F T W A R E U E R S 1 N H U M B E R F I E L D 
[7=0] FIRMWARE UERSION ==7. 

U N I T H A R D W ARE U E R S 1 N H U M E E R F I E L D 
[ 7 = 0] HARDWARE UERSION = 4 . 

RETRY LEUEL & COUNT FIELD 

[14= S ] RUNN I NG RETRY COUNT = O . 

[7=0] ERROR RECOUERY LEUEL = O . 
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gggggg i 5756, UOLUME SER I AL NUMBER F" I ELD C NOTE 5 ] 

[ 3 1 : 9 J UOLUME SER I AL NUMBER = 7150. 

00000 111245 HEADER UALUE F I ELD E NOTE 6 3 

[27 : 3 LBH= 111245 <0> 

2.10.4 Notes For RSX-11 Error Log Report (DISK TRANSFER Error Format) 

Note 1: For Transfer errors the block number (decimal) is given here. Note that it converts to 
the HEADER VALUE FIELD. 

Note 2: The MESSAGE FORMAT FIELD in this case, is a Disk Transfer Error Log Format. 

Note 3: The EVENT CODE always gives the UDA50 interpretation of the error. 

Note 4: The CONTROLLER SOFTWARE VERSION NUMBER must be 3 or higher. 

Note 5: Pack/HDA serial number. This is good to make note of on data transfer errors. 

Note 6: This is the LBN address of the error sector. This converts to 37541 Decimal which 
is the same number RSX indicated for the block number. (See Note 1 above.) 

2.10.5 Remaining Error Log Packet Formats 

The two remaining packet formats are the CONTROLLER ERROR MSCP PACKET FORMAT and the 
HOST MEMORY ACCESS ERROR PACKET FORMAT. These formats are not shown as examples, but 
the following section will highlight what is unique about them. 

2.10.5.1 Controller Error Packet Format - The UDA50 uses the controller error packet format to report 
controller errors to the host. This error log entry will look similar to the previous two examples, except 
it will be shorter than either. 

The first field to examine closely is the EVENT CODE FIELD because it is the first item that should indicate 
the problem. The list of various codes for this field can be found in the appendix Table A-5. 

If the event code is 12 (controller error), the field following the controller hardware version number should 
be examined. This field is then called the UDA50 INTERNAL ERROR CODE FIELD and can be decoded 
by looking at the appendix Table A-ll. 

2.10.5.2 Host Memory Access Packet Format - The UDA50 uses the Host Memory Access Error Log 
Packet Format to report host memory access problems. Although this entry will look very similar to the 
other examples, there are a couple of key items to look at closely: 



• 



The EVENT CODE FIELD should give a good indication of the kind of problem the UDA is 
experiencing. The event code should be looked up in appendix Table A-5. 

The last item displayed should be a field showing the Host Memory Address. This address is valuable 
since it may indicate what host memory address (or if there are several of these entries, what se- 
quence of addresses) the UDA is having trouble with. 
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2.11 INTRODUCTION TO SPEAR 

The SPEAR (Standard Package for Error Analysis and Reporting) program should be run if it is available 
on the operating system. SPEAR goes beyond the typical error log capabilities. It not only accumulates data, 
but it has the capability of analyzing and predicting which FRU is at fault. SPEAR is a library of functions 
that sorts, evaluates, and reports on events recorded in the local system event file. SPEAR is currently used 
on TOPS-10, TOPS-20, and VMS operating systems. Plans are underway to incorporate RAnn drives under 
its diagnostic analysis. 

SPEAR Reference Cards may be ordered from Printing and Circulation Services under part number 
(EK-SPEAR-RC). This reference card summarizes the SPEAR function codes, system event codes, and pro- 
vides other useful information. 

If SPEAR is not available to you, permission should be obtained from the operating system manager to run 
the subsystem diagnostics. This requires customers to give up their operating system temporarily. 

2.11.1 Faulty FRU Selection 

If SPEAR did not pick out the faulty FRU, then consult with the system manager to see if the suspect disk 
drive can be taken offline to run the drive-resident diagnostics. It is important to consult with the system 
manager first since the operating system may depend on the disk pack for a system image or back-up file. 

With the system manager's approval, place the suspect disk offline and run the drive-resident diagnostics. 

2.11.2 Replace The FRU Suggested By SPEAR 

Again, consult with the system manager before placing the disk drive offline. If the drive is already offline, 
there is no danger of bringing down the customer's operating system. Use the appropriate drive service manual 
to see how to replace the FRU. 

2.12 SUMMARY 

UDA50 problems generally appear in diagnostic printouts as SA register error codes. Drive problems appear 
in the error report without SA register error codes. Instead, the error report will print the real-time drive 
state code and the general status. When this happens, the drive maintenance guide or service manual should 
be consulted. These manuals give instructions on how to run the drive-specific diagnostics to identify the 
drive failure. 
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APPENDIX A 
TABLES 



Listed below are Tables A-l through A-ll: 
Table A-l Bit Description of Status Message Bytes 



Status Bit Description 

Byte 



Byte 1 Response Code Field — Byte 1 is the response code to a controller command. 

Byte 2 Unit Number - The unit number consists of two hexadecimal digits representing the unit 

number of the selected disk drive returning the status (0-254). 

Byte 3 Subunit Mask - The subunit mask is a four-bit representation of the subunit that is returning 

the status message. The right-most bit position represents subunit 0. The left-most bit position 
represents subunit 3. Only one bit can be set at a time. UDA50 subsystems can handle only 
drives that contain no subunits. Therefore, the only valid number in this status Bite is a hex- 
adecimal 1. Figure 2-4 shows the bit layout. For drives that contain no subunits (e.g. the 
RA80), the right-most bit position is always set to a 1 indicating subunit 0. 

Byte 3 High Unit Number - Byte 3 contains the upper four bits to a 12-bit (3 hexadecimal digits) 

unit number. 

Byte 4 OA - A binary one in this position indicates the drive is unavailable to the UDA50. A 

binary zero indicates the drive is available to the UDA50. 

Byte 4 RR - A binary one in this position indicates the drive requires an internal readjustment. 

Some drives do not use this bit. 

Byte 4 DR - A binary one in this position indicates there is a request for a diagnostic to be loaded 

in the drive microprocessor memory. A binary zero indicates that no diagnostic is being re- 
quested of the host system. 

Byte 4 SR - A binary one in this position indicates the drive spindle is up to speed. A binary zero 

indicates the drive spindle is not up to speed. 

Byte 4 EL — A binary one in this bit position indicates that there is loggable information in the ex- 

tended status area (Bytes 9-15). A binary zero indicates that no information is available in the 
extended status area. 

Byte 4 PS - A binary one in this bit position indicates the drive port select switch for this controller 

is pushed in (selected). A binary zero indicates the switch is out. 

Byte 4 RU - A binary one in this position indicates the RUN/STOP switch is pushed in (RUN). A 

binary zero indicates the switch is out (STOP). 
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Table A-l Bit Description of Status Message Bytes (Cont.) 



Status Bit Description 

Byte 



Byte 5 W4-W1 - Binary ones in any of these four bit positions represent the write-protect status for 

the sub-unit represented, (e.g., a 0001 indicates subunit within the selected drive is 
write-protected.) 

Byte 5 DD - A binary one in this bit position indicates the drive has been disabled by a controller 

error routine or diagnostic. The FAULT light is on when this bit is set. A binary indicates 
the drive was enabled by a controller error routine or diagnostic. 

Byte 5 FO - A binary one in this position indicates the drive can be formatted. 

Byte 5 DB - A binary one in this position indicates the diagnostic cylinders on the drive can be 

accessed. 

Byte 5 S7 - A binary one in this bit position indicates the 576 Byte sector format is selected. A 

binary zero indicates that the 512 Byte sector format is selected. The UDA50 does not sup- 
port 576 byte format. 

Byte 6 DE - A binary one in this position indicates a drive error has occurred and the drive 

FAULT lamp may be on. 

Byte 6 RE - A binary one in this position indicates an error occurred in the transmission of a com- 

mand between the drive and the UDA50. The error could be a checksum error or an incor- 
rectly formatted command string. 

Byte 6 PE — A binary one in this position indicates improper command codes or parameters were 

issued to the drive. 

Byte 6 DF — A binary one in this postion indicates a failure in the initialization routine of the drive. 

Byte 6 WE — A binary one in this position indicates a write lock error has occurred. 

Byte 7 S4-S1 - This is a four-bit representation of the sub-units that have their attention available 

messages suppressed in the UDA50. The right-most bit position represents sub-unit 0. The 
left-most bit position represents sub-unit 3. If one of the bits is set, it indicates the controller 
is not to interrupt the host CPU with an attention available message when the specified sub- 
unit raises its available real-time drive status line to the UDA50. The S4-S1 bits reflect the 
result of a change controller flags command where attention-available messages are not 
desired for certain sub-units. 

Byte 7 C1-C4 - This is a four-bit drive status code indicating various states of drive operation. At 

the present time only three codes are valid. A code of 0000 = drive normal operation. A 
code of 1000 = the drive is offline due to being under control of a diagnostic. A code of 
1001 = the drive is offline due to another drive having the same unit identifier (e.g. serial 
number, drive type, class etc.). 
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Table A-l Bit Description of Status Message Bytes (Cont.) 



Status Bit Description 

Byte 



Byte 8 RETRY COUNT/FAILURE CODE - This 8-bit Byte contains one of two types of informa- 

tion depending upon the status of the DF bit (Byte 6). The DF bit monitors the drive 
initialization process. The DF bit remains a zero if initialization is successful. In this case, 
Byte 8 contains the retry count from the previous operation, i.e., a seek operation required 
fourteen retries to be successful. If a get status command is initiated, Byte 8 contains the 
number 14. 

The DF bit being set indicates the drive initialization failed, and Byte 8 now contains a 
specific drive error code. This error code can be looked up in the appropriate drive service 
manual. 



Table A-2 Subsystem Diagnostic Error Code List 



Decimal Description 

Error 

Number 



EVRLA Host Error Messages 

EVRLA Initialization Error Message s 

1 Failed GETBUF routine 

2 Error trying to read DM data file 

3 Invalid program name found 

4 Invalid DM program version 

5 Failed RELBUF routine 

6 Failed GETBUF routine 

7 Error trying to read DM data file 

8 Error trying to read DM data file 

9 Failed to read P table 

10 Invalid controller encountered 

1 1 Invalid controller encountered 

12 Invalid UBA/UBI encountered 

13 Selected devices are on multiple UNIBUS adapters 

14 Duplicate controller address found 

15 Duplicate controller vectors found 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal 

Error 

Number 



Description 



16 

17 
18 
19 
20 
21 
22 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 



Failed to initialize channel adapter 
Failed to clear UBA/UBI status 
Failed to initialize device bus (UBA/UBI) 
Failed to clear UNIBUS status 
Not enough memory to test units 
Failed GETBUF routine 
Failed SETMAP routine 
EVRLA Test 1 Host Error Messages 

Failed to initialize device bus (UBA/UBI) 

Failed to clear UBA/UBI status 

Failed while checking UBA/UBI status 

Error trying to address UDAIP 

Failed while checking UBA/UBI status 

Error trying to address UDASA 

Failed to initialize device bus (UBA/UBI) 

Failed to clear UBA/UBI status 

Step bit did not set UDASA register during initialization 

UDA resident diagnostic detected a failure 

UDASA register failed to change during port loop diagnostic 

Data comparison error during port loop test diagnostic 

UDASA register failed to change during port loop diagnostic 

Data comparison error during port loop test diagnostic 

Failed to initialize device bus (UBA/UBI) 

Failed to clear UBA/UBI status 

Step bit did not set UDASA register during initialization 

UDA resident diagnostic detected a failure 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal 

Error 

Number 



Description 



19 
20 

21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 

1 
2 
3 
4 
5 



Channel services interrupt enable failure 
Channel services interrupt disable failure 
UDA failed to interrupt 
Unexpected interrupt encountered 
Unknown interrupt encountered 

Expected/received bus request (BR) levels do not match 
Expected/received vectors do not match 
Failed to initialize device bus (UBA/UBI) 
Failed to clear UBA/UBI status 
Failed to initialize device bus (UBA/UBI) 
Failed to clear UBA/UBI status 
Failed to initialize device bus (UBA/UBI) 
Failed to clear UBA/UBI status 
Channel services interrupt enable failure 
Channel services interrupt disable failure 
EVRLA Test 2 Host Error Messages 
Failed setmap routine 
Failed to initialize device bus 
Failed to clear UBA/UBI status 
Channel services interrupt enable failure 
Channel services interrupt disable failure 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal 

Error 

Number 



Description 



EVRLA Test 3 Host Error Messages 

1 Failed setmap routine 

2 Failed to initialize device bus (UBA/UBI) 

3 Failed to clear UBA/UBI status 

4 Channel services interrupt enable failure 

5 Channel services interrupt disable failure 
EVRLA Test 4 Host Error Messages 

1 Failed setmap routine 

2 Failed to initialize device bus (UBA/UBI) 

3 Failed to clear UBA/UBI status 

4 Channel services interrupt enable failure 

5 Channel services interrupt disable failure 
EVRLA UDA Initialization error messages 

100 Not enough free memory to test units selected 

101 Step bit did not set in UDASA register during initialization 

102 UDA resident diagnostics detected a failure 

103 UDA did not return correct data in UDASA register during initialization 

104 UDASA register did not go to zero after step 3 write of initialization 

105 Step bit did not set in UDASA register during initialization 

106 UDA resident diagnostics detected a failure 

107 UDA did not return correct data in UDASA register during initialization 

108 UDA did not clear ring structure in host memory 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal 

Error 

Number 



Description 



EVRLA PUP Protocol Errors 

200 Diagnostic machine (DM) program asked for data from unknown drive 

300 Response packet from UDA does not contain expected data 

401 Microcode reported M7485, M7486 that did not match get status response 

402 Microcode reported unknown controller model 

403 Response packet from UDA does not contain expected data 

500 UDA reported a fatal error while loading DM program 

501 UDA failed to interrupt 

502 UDA reported a fatal error while waiting for get status response 

503 UDA failed to interrupt 

601 Unknown request received from DM program 

602 DM program asked for data from unknown drive 

603 Same as 602 

604 Same as 602 

605 Same as 602 

606 Same as 602 

607 Same as 602 

608 Same as 602 

609 Same as 602 

610 No interrupt received from DM program 

611 Fatal error while running DM program 

612 Failed RELBUF routine 

700 Failed GETBUF routine 

701 Failed SETMAP routine 
750 Failed GETBUF routine 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal Description 

Error 

Number 



EVRLA Interrupt Handler Error Message s 

800 Unknown interrupt encountered 

801 Same as 800 

802 Unexpected interrupt encountered 

CZUDC Host Error Messages 

00001 The program does not like the way you answered the hardware questions, or the UDA50 was 
given more than 1 vector, or BR level, or burst rate 

00002 The program does not like the way you answered the hardware questions, two units select the 
same drive 

00003 The program does not like the way you answered the hardware question, more than eight 
drives selected on UDA 

00004 Not enough room in memory to test the units selected 

00005 Checksum error in DM program file 

00006 Table inconsistency error, reload program 

00007 Error in DM program file, DM program not found 

00008 Two UDAs use the same vector 

00009 Illegal configuration for test 4 

00010 Wrong APT diagnostic is being used with this controller, use CIUDX 
00013 Microcode reports controller model that did not match get status response 

00020 Memory error trying to read UDA registers, check UNIBUS selection switches on UDA 
module M7485 or UNIBUS 

00021 UDA resident diagnostic failure, replace M7486 

00022 Step bit did not set in UDASA register, replace M7485 

00023 UDA did not clear ring structure in host memory during initialization, replace M7485 

00024 UDASA register did not go to zero after step 3 write of initialization, suspect either M7485 
or the UNIBUS 

00025 UDA did not return correct UDASA register information, replace M7485 

00026 Data compare error port loop test, replace M7485 



A-8 



Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal Description 

Error 

Number 



00027 Could not write to UDASA register, replace M7485 

00028 UDA did not interrupt the host, replace M7485 

00029 UDA interrupted at different BR level, replace M7485 

00030 UDA reported error in UDASA register list of UDASA codes 

00031 Assume DM program hung 

00032 Unknown request number in DM message buffer, suspect UNIBUS UDA or corrupted DM 
program 

00033 Response packet from UDA does not contain expected data 

00036 No interrupt received from UDA for 30 seconds 

00037 UDA reports error in UDASA register 

00038 Memory error trying to read UDASA register, check UNIBUS select switches on M7486, or 
UNIBUS, or replace M7485 

Common EVRLA and CZUDC DM Program Error Messages 

TEST 1 DM Error Messages 

01000 A nonexistent host memory error 

01001 Parity error on read from UNIBUS 

01002 Memory location did not contain own address 

01003 Nonexistent memory error trying to read from UNIBUS buffer 

01004 Parity error on read from UNIBUS within buffer 

01005 Data compare failed after read then write from UNIBUS 

01006 UNIBUS addressing error 
Test 2 DM Error Messages 

02000 Host specified unit number can not be found 

02001 Cannot receive valid drive state, check drive power 

02002 Drive state received with bad parity 

02003 Drive not asserting receiver ready 

02004 Timeout on send of echo command to drive 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal 

Error 

Number 



Description 



02005 
02006 
02007 
02008 
02009 
02010 
02011 
02012 
02013 
02014 
02015 
02016 
02017 
02018 
02019 
02020 
02021 
02022 
02023 
02024 
02025 
02026 
02027 
02028 
02029 
02030 



Echo during receive of echo response from drive 

Echo command responded with different data 

Error bit set in get status response after drive clear 

Timeout on send of online command to drive 

Error during receive of online response from drive 

Online command was unsuccessful 

Online command did not return expected response code 

Timeout on send of get unit characteristics command to drive 

Error during receive of get unit characteristics command 

Get unit characteristics command was unsuccessful 

Get unit characteristics command did not return expected response code 

Host program gave DM code improper data 

Timeout on send of diagnose command to drive 

Error during receive of diagnose command response from drive 

Diagnose command was unsuccessful 

Diagnose command did not return expected response code 

Drive diagnostic reports a hard error 

Host program downline loaded a diagnostic with zero byte count 

Diagnostic requested by drive could not be supplied by host 

Timeout on send of memory read command to drive 

Error during receive of memory read response from drive 

Memory read command was unsuccessful 

Memory read command did not return expected response code 

Timeout on send of memory write command to drive 

Error during receive of memory write command response from drive 

Memory write command was unsuccessful 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal 

Error 

Number 



Description 



02031 
02032 
02033 
02034 
02035 
02036 
02037 
02038 
02039 
02040 
02041 
02042 
02043 
02044 
02045 
02046 
02047 
05000 

03001 
03002 
03003 
03004 
03005 
03006 
03007 



Memory write command did not return expected response code 
Timeout on send of run command to drive 
Error during receive of run command response from drive 
Run command was unsuccessful 
Run command did not return expected response code 
Timeout on send of recalibrate command to drive 
Error during receive of recalibrate response from drive 
Recalibrate command was unsuccessful 
Recalibrate command did not return expected response code 
Timeout on send of get status command to drive 
Error during receive of get status response from drive 
Get status command was unsuccessful 
Get status command did not return expected response code 
Timeout on send of drive clear command to drive 
Error during receive of drive clear command from drive 
Drive clear command was unsuccessful 
Drive clear command did not return expected response code 
Unable to find requested drive for testing 
Test 3 DM Error Messages 

Timeout on send of a level 2 command 

Timeout of receive on get common characteristics command 

First word received was not a start frame 

Framing error on level response 

Checksum error on level response 

Response longer than expected 

Code received from subsystem unintelligible 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal 

Error 

Number 



Description 



03008 
03009 
03010 
03011 
03012 
03013 
03014 
03015 
03016 
03017 
03018 
03019 
03020 
03022 
03023 
03024 
03025 
03026 
03027 
03028 
03029 
03030 
03031 
03032 
03033 
03034 



Command did not return expected response code 

Drive not asserting receiver ready in drive state 

Failed to receive valid drive state 

Can not receive drive state from drive check power 

Drive state received with bad parity 

No valid state from drive 

Subunit characteristics say there are zero read only groups in diagnostic area 

Subunit characteristics say less than 1 read/write groups in diagnostic area 

Neither read/write ready nor attention set after recalibration command 

Subunit characteristics say less than 1 diagnostic cylinder 

Read/write ready dropped before format operation 

Format operation reported timeout failure 

After recalibration, error bits were set 

Read/write ready dropped before write operation 

Could not read or write any block on this track, write operation failure 

Read/write ready dropped before read operation 

Could not read or write any block on track, read operation failure 

Could not read or write any block on track, data compare word failure 

Seek complete time-out, read/ write ready did not set 

No block on this track can be read, last block tried 

Available was not asserted after disconnect 

Invalid command [aaaa] was successful 

Command with type length = A was successful 

Unit did not report transmission error 

Unit accepted an invalid group number from group select level 1 

Unable to correctly read overlay 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal 

Error 

Number 



Description 



03035 
05000 

04001 
04002 
04003 
04004 
04005 
04006 
04007 
04008 
04009 
04010 
04011 
04012 
04013 
04014 
04015 
04016 
04017 
04018 
04019 
04020 
04021 
04023 
04024 



Successfully wrote in DBN area while drive was write protected 
Unable to find requested drive for testing 
Test 4 DM Error Messages 

Attention asserted during seek error or loggable information 

Attention asserted unexpectedly, asynchronous drive error or log 

Seek did not complete, neither attention nor read/ write ready asserted 

RCT area corrupted, could not find replacement for RCT LBN 

Header not found during write 

Select track and write level 1 command not executed 

ECC detected error 

ECC detected error, but correction failed 

ECC corrections exceeded threshold 

ECC correction succeeded, but EDC detects error 

Error recovery tried all levels without success 

Data comparison failed, whether detected by ECC or EDC or not 

Drive not on line to UDA and not spinable 

Unable to complete seek, tried three times 

Seek required [nnn] retries before completing 

Errors during drive initialization and setup 

No valid states from drive, no drive clocks 

Attempt to write on write protected drive error code from UDA 

Header not found during read 

Select track and read level 1 command not executed 

Drive not formatted in 512 byte mode 

Unable to continue testing, port switch out, or run/stop switch out, or spindle dropped ready 

EDC detected error, but ECC did not 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



Decimal 

Error 

Number 



Description 



04025 
04026 
04028 
04033 
04034 
04035 
04036 
04037 
04038 
04040 
04041 
04042 
04044 
04045 
04047 
04048 
04049 
04050 
04051 
04052 
04053 
04054 
04055 
04056 
04057 
04058 



Write attempted maximum times 

Read attempted maximum times 

Both read only and write only bits set-host error 

Unable to correctly read overlay 

SERDES overrun error during read 

Data or state clock timeout during read 

Data synchronization timeout during read 

Read/write ready dropped during read 

Receiver ready dropped during read 

All copies of RCT read with errors, LBN with header not found 

Could not find replacement for LBN that was revectored 

Time-out waiting for sector or index pulse 

Seek or head select error detected during write 

Seek or head select error detected during read 

Data or state clock timeout during write 

Read/write ready dropped during write 

Receiver ready dropped during write 

Operator error-beginning block number greater than ending number 

Operator error, the begin/end sets overlap 

Operator error, begin/end block number exceeds maximum 

Operator error, duplicate bad blocks 

Operator error, bad block number exceeds maximum 

Operator error, start cylinder greater than ending cylinder 

Operator error, random and sequential seek cannot be mixed 

Operator error, overflow calculating the LBN/DBN from cylinder 

Operator error, track exceeds maximum for device, or group exceeds maximum for device. 
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Table A-2 Subsystem Diagnostic Error Code List (Cont.) 



.L/cClITlJlI 

Error 
Number 



Description 



04059 
04062 
04063 
04064 
04068 
04069 
04070 
04071 
04072 
04073 
04074 
04075 
04076 
04077 
04078 
05000 



Operator error, two identical tracks or groups 

Operator error, cylinder too large DBN/LBN exceeds maximum 

Real-time state received error during write 

Real-time state received error during read 

Unknown error code during write 

Unknown error code during read 

Timeout of send 

Timeout of receive 

First word received was not a start frame 

Framing error on level receive 

Checksum error on level receive 

Buffer size smaller than receive 

Response level 2 command not as expected 

Drive never deasserted receiver ready after send 

Unknown error code returned from level 2 receive 

Unable to find requested drive for testing 



Table A-3 Error Log Event Format Codes 



Format 


Format 


Format 


code 


Code 


Code 


(Dec.) 


(Octal) 


(Hex) 











1 


1 


1 


2 


2 


2 


3 


3 


3 



Format Description 



Controller errors 
Host memory access errors 
Disk transfer errors 
SDI errors 
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Table A-4 Error Log Message Flags 



Bit set 
in high 
byte of 
word 4 



Octal 



Hex 



Error Message Flag Description 



200 
100 
1 



80 
40 
1 



Operation successful flag 
Operation continuing flag 
Sequence number reset flag 



Table A-5 Error Log Status Event Codes 



Hex 
Code 



Octal 
Code 



Description 



The first list in this table is a group of codes that determine the major status or event being reported such as a media 
format error or a drive error, etc. Within these major categories are more specialized subcodes that break down the 
major category further. For example, if a hex code of (B) is a drive error, a hex code of (AB) reveals there is also 
a drive clock dropout. A separate list is given for each of the subcode values. 









Success 


1 


1 


Invalid command 


2 


2 


Command aborted 


3 


3 


Unit offline 


4 


4 


Unit available 


5 


5 


Media format error 


6 


6 


Write protected 


7 


7 


Compare error 


8 


10 


Data error 


9 


11 


Host buffer access error 


A 


12 


Controller error 


B 


13 


Drive error 


IF 


37 


Status/event code mask 



Success Subcode (Hex 0) 



20 


40 


Spindown ignored 


40 


100 


Still connected 


80 


200 


Duplicate unit number 


100 


400 


Already online 


200 


1000 


Still online 



Invalid Command Subcode (Hex 1) 

1 1 Invalid message length 

Command Aborted Subcode (Hex 2) NOT USED 

Unit Offline Subcode (Hex 3) 

3 3 Unit unknown or online to another controller 

23 43 No volume mounted or drive disabled via RUN/STOP switch 
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Table A-5 Error Log Status Event Codes (Cont.) 



Hex Octal Description 

Code Code 



C5 


305 


105 


405 


125 


445 



43 103 Unit inoperative - For SDI drives, the controller has marked the drive in- 

operative due to an unrecoverable error in a previous level 2 exchange, or the 
drive has a duplicate unit identifier. 

83 203 Duplicate unit number 

103 403 Unit disabled by field service or diagnostic. For SDI drives, the DD bit is set. 

Unit Available Subcode (Hex 4) NOT USED 

Media Format Error Subcode (Hex 5) 

A5 245 Format mismatch — Disk is not formatted with 512 byte sectors. The disk's 

FCT indicates it is formatted with 576 byte sectors, and either the controller or 
the drive only supports 512 byte sectors. 

FCT corrupted - Disk is not formatted or the FCT is corrupted. 

RCT corrupted - The RCT search algorithm encounters an invalid RCT enter. 

No replacement block available 

Write-Protected Subcode (Hex 6) 

1006 10006 Unit is software write protected. 

2006 20006 Unit is hardware write protected. 

Compare Error Subcode (Hex 7) NOT USED 

Data Error Subcode (Hex 8) 

8 10 Sector written with "Force Error" modifier. 

48 110 Invalid header - The subsystem reads an invalid or inconsistent header for the 

requested sector. Causes of an invalid header include header mis-sync, header 
sync time-out, or an inconsistent header. 

68 150 Data sync time-out - Data sync is not found. 

88 210 Correctable error in ECC field - A transfer encounters a correctable error in 

which only the ECC field is affected. 

E8 350 Uncorrectable ECC error - A transfer encounters an ECC error that exceeds 

the correction capability of the subsystem's error correction algorithm. 



108 


410 


One symbol ECC error 


128 


450 


Two symbol ECC error 


148 


510 


Three symbol ECC error 
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Table A-5 Error Log Status Event Codes (Cont.) 



Hex Octal Description 

Code Code 



Four symbol ECC error 

Five symbol ECC error 

Six symbol ECC error 

Seven symbol ECC error 

Eight symbol ECC error - A transfer encounters a correctable ECC error with 
the specified number of ECC symbols in error. The number of symbols in er- 
ror corresponds to the severity of the error. 

Host Buffer Access Error Subcode (Hex 9) 

9 1 1 Host buffer access error - The controller is unable to access a host buffer to 

perform a transfer and has no visibility into the cause of the error. 

29 51 Odd transfer address 



168 


550 


188 


610 


1A8 


650 


1C8 


710 


1E8 


750 



49 


111 


Odd byte count 


69 


151 


Nonexistent memory error 


89 


211 


Host memory parity error 



Controller Error Subcode (Hex A) 

A 12 Reserved for host command timeout expired 

2A 52 SERDES overrun or underrun error - Either the drive is too fast for the con- 

troller, or a controller hardware fault has prevented the controller microcode 
from being able to keep up with the data transfer to or from the drive. 

4 A 112 EDC error - The sector is read with correct or correctable ECC and an in- 

valid EDC. There is most likely a fault in the ECC logic of this controller or 
the controller that last wrote the sector. 

6A 152 Inconsistent internal control structure - Some high level check detects an in- 

consistent data structure. For example, a reserved field contains a nonzero 
value, or the value in a field is outside its valid range. This error usually im- 
plies the existence of a microcode bug. 

8 A 212 Internal EDC error - Some low level check detects an inconsistent data struc- 

ture. For example, a microcode implemented checksum or vertical parity (hard- 
ware parity is horizontal) associated with internal sector data is inconsistent. 
This error usually implies a fault in the memory addressing logic of one or 
more of the controller's processing elements. It may also result from a double 
bit error or other error that exceeds the error detection capability of the con- 
troller's hardware memory checking circuitry. 

10A 412 Data bus overrun - The controller attempts to perform too many concurrent 

transfers, causing one or more of them to fail due to a data overrun or 
underrun. 
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Table A-5 Error Log Status Event Codes (Cont.) 



Hex 
Code 



Octal 
Code 



Description 



12A 



452 



14A 512 

16A 552 

Drive Error Subcode (Hex B) 
2B 53 



4B 



6B 



8B 



113 



153 



213 



AB 



CB 



EB 



10B 



12B 



253 



313 



353 



413 



453 



Data memory error — The controller detects an error in an internal memory, 
such as a parity error or a nonresponding address. This subcode only applies to 
errors not reported via MSCP. These errors do not affect the controller's abili- 
ty to properly generate end and error log messages. For most controllers, this 
subcode is only returned for controller memory errors in data or buffer 
memory and noncritical control structures. If the controller has several such 
memories, the specific memory involved is reported as part of the error ad- 
dress in the error log message. 

PLI reception buffer parity error (N/A for UDA) 

PLI transmission buffer parity error (N/A for UDA) 



Drive command timeout - For SDI drives, the controller's timeout expires for 
either a level two exchange or the assertion of read/write ready after an initiate 
seek. 

Controller-detected transmission error - For SDI drives, the controller detects 
an invalid framing code or a checksum error in a level two response from the 
drive. The UDA50 also returns this subcode for controller detected protocol er- 
rors. All other SDI controllers return subcode 9 for protocol errors. 

Positioner error (mis-seek) — The drive reports a seek operation is successful, 
but the controller has determined the drive has positioned itself to an incorrect 
cylinder. 

Lost read/ write ready during or between transfers - For SDI drives, 
read/write ready is negated when the controller attempts to initiate a transfer or 
at the completion of a transfer. Read/ write ready is previously asserted in- 
dicating the completion of the previous seek. This usually results from a drive 
detected transfer error, in which case an additional error log message may be 
generated containing the "drive detected error" subcode. 

Drive clock dropout - For SDI drives, either data clock or state clock is miss- 
ing when it should be present. This is usually detected by a timeout. 

Lost receiver ready for transfer - For SDI drives, receiver ready is negated 
when the controller attempts to initiate a transfer or does not assert at the com- 
pletion of a transfer. This includes all cases of the controller's time-out expir- 
ing for a transfer operation (level .one real-time command). 

Drive-detected error - For SDI drives, the controller receives a get status or 
unsuccessful response with the EL flag set. The controller may also receive 
this response with the DR flag set. It does not support automatic diagnosis for 
that drive type. 

Controller-detected pulse or data parity error - For SDI drives, the controller 
detects a pulse error on either the state or data line, or the controller detects a 
parity error in a state frame. 

Drive-requested error log (EL bit set) 
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Table A-5 Error Log Status Event Codes (Cont.) 



Hex 
Code 



Octal 
Code 



Description 



14B 



16B 



18B 



1AB 



1CB 



513 



553 



613 



653 



713 



Response length or opcode error - For SDI drives, a level two response from 
the drive has an invalid opcode, an improper length, or is not a possible 
response in the context of the exchange. 

Clock resumption fails after initialization - For SDI drives, the drive clock 
does not start after a controller attempt to initialize the drive. 

Clock persists after initialization - For SDI drives, the drive clock continues 
beyond drive initialization. 

Receiver-ready collision - For SDI drives, the controller attempts to assert its 
receiver ready (to receive a response) and the drive's receiver ready is still 
asserted (to receive a command). 

Response overflow 



Table A-6 Controller Class Values 



Class 

Byte 

(Decimal) 



Subsystem Type 



Reserved 

Mass storage controllers 

Disk class device - DEC Standard 166 format 

Tape class device 

Disk class device - DEC Standard 144 format 



Table A-7 Controller Model Values 



Model 

Byte 

(Decimal) 



Controller Type 






Reserved 


1 


HSC50 


4 


VMS MSCP server 


5 


TU81 


6 


UDA50 
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Table A-8 Drive Model Number Values 



Model 

Byte 

(Decimal) 



Device Model 



RA80 fixed media disk drive 
RA60 removable media disk drive 
RA81 fixed media disk drive 



Table A-9 MSCP Error Codes 



Octal 
Code 



Definition 



1 

2 

3 

4 

5 

6 

7 

10 

12 



Error is logged by the bad block replacement module 

Driver is sending a command at the time of the error 

Driver can not find a free command packet 

Driver determined the unit is hung 

Disk unit size is too big (over pack cluster size 16) 

Controller is offline 

Unit is not functional 

Command timed out 

Data error during read/write command 



Table A-10 Status Code of the MSCP Packet 



Packet Code 



MSCP Packet Status 



Success 

Invalid command 
Command aborted 
Unit Off-line 
Unit available 
Media format error 
Write protected 

Compare error 

Data error 



The command or retry of a failed command is unsuccessfully completed. 

An invalid command or command parameters are received by the controller. 

The controller aborts a command in progress. 

The unit identified in the "unit number" field is in the off-line state. 

The unit identified in the "unit number" field is in the available state. 

The pack or HDA mounted in the drive appears to be formatted incorrectly. 

A command requiring a write operation is attempted on a write-protected 
unit. 

A compare host data command finds a difference in the data that is written 
and the data in host memory like a write check command. 

Invalid or uncorrectable data is obtained from the drive. 
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Table A-10 Status Code of the MSCP Packet (Cont.) 



Packet Code 



MSCP Packet Status 



Host buffer access buffer 

Controller error 
Drive error 



The controller encounters an error, like UNIBUS timeout, when trying to 
access host memory. 

The controller encounters an internal controller error. 

The controller discovers an error within a drive. Such errors are usually 
mechanical in nature since they are reported as data errors. 



Table A-ll UDA50 Internal Error Code 



Internal 
Error Code 
(HEX) 



Description 



1 



B 



ER.PRD - UNIBUS packet read error. This error can occur when the 
U.RECV routine fails to get the MSCP packet from host memory. 

ER.PWR - UNIBUS packet write error. This error is set if the UNIBUS 
write (UNB-WR) times out while attempting to send a packet to the host. 

ER.RRP - UDA50 ROM and RAM parity error. 

ER.RAP - UDA50 RAM parity error. This error can be corrected by 
changing UDA50 module M7486. 

ER.ROP - UDA50 ROM parity error. This error can be corrected by 
changing UDA50 module M7485. 

ER.RRD - UNIBUS ring read error. Set if the UNIBUS read times out 
while reading the host command or response ring. 

ER.RWR - UNIBUS ring write error. Set if the UNIBUS write to update 
the command or response ring descriptor fails. 

ER.INT - UNIBUS interrupt master failure. Set if the UDA50 fails to 
become interrupt master. This is a long timeout. 

ER.HTO - Host access timeout error. This error occurs if the host timer is 
not reset and times out. The timer is reset by the U.SEND, U.RECV, or 
U.BFSV routines. 

ER.NIM - Host exceeded command limit. Set if a nonimmediate command 
is loaded in LOG PKT space. 

ER.MST - Bus master error. Set if the UDA50 fails to become bus master 
during the start up of a UNIBUS read or a UNIBUS write 
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Table A-ll UDA50 Internal Error Code (Cont.) 



jLincrHtfii 
Error Code 
(HEX) 



Description 



17 



ER.DMX - Diagnostic mode XFC error. 

ER.VCI - Invalid virtual circuit identifier. The UDA50 is trying to execute 
an MSCP command in the wrong mode. 

ER.IWR - Interrupt write error on UNIBUS. Set if the UDA cannot write 
to the host and set the Response ring not zero. 

ER.SUN - Too many subunits on the UDA50. 
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